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26 March 2001 (26.03.01) 
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2. The election | X| was 

I I was not 

made before the expiration of 19 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 
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PATENT COOPERATION TREATY 

PCT 



INTERNATIONAL SEARCH REPORT 

(PCX Article 18 and Rules 43 and 44) 



Applicant's or agenf s file reference 
18134/8X109 


FOR FURTHER Notification of Transmittal of international Search Report 
^ , (Form PCT/ISA/220) as well as, where applicable, item 5 below. 
ACTION 


International application No. 

PCT/NZ 00/00174 


International filing date (day/month/year) 

04/09/2000 


(Earliest) Priority Date (day/month/year) 

02/09/1999 


Applicant 

NEW ZEALAND PASTORAL AGRICULTURE RESEARCH INSTI... 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. 

This International Search Report consists of a total of 6 sheets. 

[X] It is also accompanied by a copy of each prior art document cited in this report. 



1 . Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the International application in the 
language in which it was filed, unless otherwise indicated under this item. 



□ 



the international search was carried out on the basis of a translation of the international application fumished to this 
Authority (Rule 23.1(b)). 



b. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 
was carried out on the basis of the sequence listing : 

pr| contained in the international application in written form. 
I I filed together with the international application in computer readable form, 
furnished subsequently to this Authority In written form, 
furnished subsequently to this Authority in computer readble form. 



□ 

m 
ffl 

m 



the statement that the subsequently furnished written sequence listing does not go beyond the disclosure In the 
international application as filed has been furnished. 

the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 



2. 
3. 



Pn Certain claims were found unsearchable (See Box I). 
I I Unity of Invention Is lacking (see Box 11). 



4. With regard to the title, 

I I the text is approved as submitted by the applicant. 
pr| the text has been established by this Authority to read as follows: 
NUCLEOTIDES SEQUENCES ENCODING AN INSECTIDAL PROTEIN COMPLEX FROM SERRATIA 



5. With regard to the abstract, 

pr| the text is approved as submitted by the applicant. 

I I the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may, 
' — ' within one month from the date of mailing of this international search report, submit comments to this Authority. 

6. The figure of the drawings to be published witti the abstract is Figure No. 



I I as suggested by the applicant. \T\ None of the figures. 

I I because the applicant failed to suggest a figure. 

I I because this figure better characterizes the invention. 



Form PCT/ISA/210 (first sheet) (July 1998) 



# 

International Application No. PCT/NZ 00 ^0174 



FURTHER INFORMATION CONTINUED FROM POT/ISA/ 210 



Continuation of Box 1.2 
Claims Nos. : 17 



Present claim 17 relates to a ligand defined by reference to a desirable 
characteristic or property, namely binding to the polypeptide of claim 
15. 



The claim covers all products having this characteristic or property, 
whereas the application provides no support within the meaning of Article 
6 PCT and/or disclosure within the meaning of Article 5 PCT for such 
products. 

In the present case, the claims so lack support, and the application so 
lacks disclosure, that a meaningful search over the whole of the claimed 
scope is impossible. Independent of the above reasoning, the claims also 
lack clarity (Article 6 PCT). An attempt is made to define the product by 
reference to a result to be achieved. 

Again, this lack of clarity in the present case Is such as to render a 
meaningful search over the whole of the claimed scope impossible. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 



INTERNATIONAL SEARCH REPORT 



A. CLASSIRCATION OF SUBJECT MATTER*^ ~~ 

IPC 7 C12N15/31 CIZnJI^O C12N15/82 C07K14/24 W12Q1/68 
A01N63/02 A01H5/00 

According to International Patent Classification (IPC) or to both national classification and IPC 



International Application No 

PCJ/ta 00/00174 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by dasslfication symbols) 

IPC 7 C12N C07K AOIH C12Q 



Documentation searched other than minimum documentation to the extent that such documents are included In the fields searched 



Electronic data base consulted during the international search (name of data base and. where practical, search terms used) 

BIOSIS, EPO-Internal , STRAND, WPI Data, PAJ 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



JACKSON T A ET AL: "PATHOGEN TO PRODUCT 
DEVELOPMENT OF SERRATIA-ENTOMOPHILA 
ENTEROBACTERIACEAE AS A COMMERCIAL 
BIOLOGICAL CONTROL AGENT FOR NEW ZEALAND 
GRASS GRUB COSTELYTRA-ZEALANDICA" 
JACKSON, T. A. AND T. R. GLARE (ED.). USE 
OF PATHOGENS IN SCARAB PEST, 

1992, pages 191-198, XP000997900 
0-946707-35-9. 1992 
the whole document 

-/-- 



32 



[ X I further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



^ Special categories of cited documents : 

'A' document defining the general state of the art which is not 
considered to be of particular relevance 

*E' earlier document but published on or after the international 
filing date 

'L' document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

•Q' document referring to an oral disclosure, use. exhibition or 
other means 

P' document published prior to the international filing date but 
later than the priority date claimed 



•T* later document published after the international filing date 
or pnority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 

invention 

*X' document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

'Y* document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

"&■ document member of the same patent family 



Date of the actual completk>n of the international search 

23 May 2001 


Date of mailing of the internatk>nal search report 

06/06/2001 


Name and mailing address of the ISA 

European Patent Off tee. P.B. 5818 Patent laan 2 
NL-2280HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 


Authorized officer 

Holtorf, S 
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INTERNATIONAL SEARCH REPORT 



C.(Continuation) DOCUMENTS CONSIDEREI 



Category Citation of document, with indication.where appropriate, of the relevant passages 



International Application No 

PCT/MZ 00/00174 



RELEVANT 



Relevant to cl£um No. 



GRKOVIC STEVE ET AL: "Genes Essential for 

Amber Diseae in Grass Grubs Are Located on 

the Large Plasmid Found in Serratia 

entomophila and Serratia proteamaculans. " 

APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 

vol. 61, no. 6, 1995, pages 2218-2223, 

XP000994573 

ISSN: 0099-2240 

cited in the application 

the whole document 

GLARE TRAVIS R ET AL: "Plasmid transfer 

among several members of the family 

Enterobacteri aceae increases the number of 

species capable of causing experimental 

amber disease in grass grub." 

FEMS MICROBIOLOGY LETTERS, 

vol. 139, no. 2-3, 1996, pages 117-120, 

XP000998482 

ISSN: 0378-1097 

cited in the application 

the whole document 

WO 99 42589 A (NOVARTIS ERFIND VERWALT 
GMBH ;NOVARTIS AG (CH); KRAMER VANCE CARY) 
26 August 1999 (1999-08-26) 
the whole document 

WO 98 08932 A (DOW AGROSCIENCES LLC 
;WISCONSIN ALUMNI RES FOUND (US)) 
5 March 1998 (1998-03-05) 
the whole document 

WO 98 08388 A (MORGAN JAMES ALUN WYNNE 
;JARRETT PAUL (6B); ELLIS DEBORAH JUNE 
(GB) 5 March 1998 (1998-03-05) 
the whole document 

WO 97 17432 A (WISCONSIN ALUMNI RES FOUND) 
15 May 1997 (1997-05-15) 
the whole document 

BOWEN D ET AL: "INSECTICIDAL TOXINS FROM 

THE BACTERIUM Photorhabdus liminescens" 

SCIENCE, AMERICAN ASSOCIATION FOR THE 

ADVANCEMENT OF SCIENCE,, US, 

vol. 280, 26 June 1998 (1998-06-26), pages 

2129-2132, XP002115650 

ISSN: 0036-8075 

cited in the application 

-/-- 
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International Application No 

PCJjjg^ 00/00174 


C.(Continuation) DOCUMENTS CONSIDEREHI^P^E RELEVANT 


Category * 


Citation of document, witti indication.where appropriate, of tfie relevant passages 


Relevant to claim No. 


A 

P,X 


NUNEZ-VALDEZ M E ET AL: "The ambZ locus 
from Serratia entomophila confers 
ant 1 -feeding effect on larvae of 
Costelytra zealandica (Coleoptera: 
Scarabaeidae)" 

GENE, NL, ELSEVIER BIOMEDICAL PRESS. 

AMSTERDAM, 

vol . 172, no. 1, 

12 June 1996 (1996-06-12), pages 75-79, 

XP004042712 

ISSN: 0378-1119 

cited in the application 

HURST MARK R H ET AL: "PI asmid-located 
pathogenicity determinants of Serratia 
entomophila, the causal agent of amber 
disease of grass grub, show similarity to 
the insecticidal toxins of Photorhabdus 
luminescens. " 
JOURNAL OF BACTERIOLOGY, 
vol . 182, no. 18, 

September 2000 (2000-09), pages 5127-5138, 
XP002166799 
ISSN: 0021-9193 
the whole document 


1-4. 
9-16, 
21-27, 
31,41 



Form PCT/ISA/210 (continuation of second sheet) (July 1992) 



page 3 of 3 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



Patent document 
cited In search report 



blication 
date 



International Application No 

PCJJUl 00/00174 



Patent family 
member(s) 



Publication 
date 



WO 9942589 



26-08-1999 



AU 
EP 



3028699 A 
1054972 A 



06-09-1999 
29-11-2000 



WO 9808932 A 05-03-1998 



AU 


729228 


B 


25- 


-01- 


■2001 


AU 


1050997 


A 


29- 


-05- 


-1997 


AU 


2829997 


A 


19- 


-03- 


-1998 


BR 


9606889 


A 


28- 


-10- 


-1997 


BR 


9711441 


A 


24- 


-10- 


■2000 


CA 


2209659 


A 


15- 


-05- 


■1997 


EP 


0797659 


A 


01- 


-10- 


-1997 


EP 


0970185 


A 


12- 


-01- 


-2000 


HU 


9900768 


A 


28- 


-06- 


-1999 


JP 


2000515024 


T 


14- 


-11- 


-2000 


PL 


321212 


A 


24- 


-11- 


-1997 


PL 


332033 


A 


16- 


-08- 


-1999 


SK 


24699 


A 


10- 


-04- 


•2000 


SK 


93197 


A 


06- 


-05- 


■1998 


TR 


9901126 


T 


21- 


-07- 


■1999 


WO 


9717432 


A 


15- 


-05- 


■1997 



WO 


9808388 


A 


05-03-1998 


AU 


4024997 


A 


19-03-1998 










BR 


9711285 


A 


17-08-1999 










CN 


1233938 


A 


03-11-1999 










EP 


0923295 


A 


23-06-1999 










TR 


9900435 


T 


21-06-1999 










ZA 


9707373 


A 


15-02-1999 


WO 


9717432 


A 


15-05-1997 


AU 


729228 


B 


25-01-2001 










AU 


1050997 


A 


29-05-1997 










BR 


9606889 


A 


28-10-1997 










CA 


2209659 


A 


15-05-1997 










EP 


0797659 


A 


01-10-1997 










HU 


9900768 


A 


28-06-1999 










PL 


321212 


A 


24-11-1997 










PL 


332033 


A 


16-08-1999 










SK 


93197 


A 


06-05-1998 










AU 


2829997 


A 


19-03-1998 










BR 


9711441 


A 


24-10-2000 










EP 


0970185 


A 


12-01-2000 










JP 


2000515024 


T 


14-11-2000 










SK 


24699 


A 


10-04-2000 










TR 


9901126 


T 


21-07-1999 










WO 


9808932 


A 


05-03-1998 
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To: 

kES& WELLS 
Private Bag 3140 
HAMILTON 
New Zealand 




PCX 

NOTIFrCATION OF TRANSMITTAL OF 
INTERNATIONAL PRELIMINARY EXAMINATION 
REPORT 

(PCT Rule 71.1) 






Date of mailing 
day/month/year 


1 DEC 2001 


Applicant's or agenfs file reference 
18134/8X109SB 


IMPORTANT NOTIFICATION 


International Aji^lication No. 
PCT/N200y00174 


International Filing Date 
4 September 2000 


Priority Date 

2 September 1999 


Applicant 

NEW ZEALAND PASTORAL AQRICmTUI^RESEARC3ID^^ ctal 



The applicant is hereby notified that this International Preliminary Examining Authority transmits herewi^ the 
inteniational prelixmnaxy examination report and its annexes, if any, established on the international application. 

A copy of the report and its annexes, if any, is being transmitted to die Internationa] Bureau for communication to all the 
elected Offices. 

Where required by any of the elected OfRces, the International Bureau -will prep^ an English translation of the report 
(but not of any annexes) and will transmit such translations to ^ose OfHces, 

REMINDER 

The applicant must enter the national phase before each elected OfiSce by perforaiing certain acts (filing translations and 
paying national fees) within 30 months from the priority date (or later in some Ofnces)(Articlc 39(l))(see also the 
rexninder sent by the International Bureau with Form PCr/IB/30 2). 

Where a transiatioh of tiie international application must be furnished to an elected OfHce, that translation must contain a 
translation of any annexes to the international preliminaxy examixiation report. It is the applicants responsibility to 
prepare and fiamish such translation directly to each elected Office concerned. 

For further details on the applicable time limits and requirements of the elected Offices, see Voltmie II of the PCT 
Applicant's Guide 



Name and mailing address of the IPEA/AU 


Authorized o£Qcer 


AUSTRALIAN PATENT OFFICE 




PO BOX 200, WODEN ACT 2606. AUSTRALIA 




E-mail address: pct@ipaustTalia^gov.au 


GARETHCOOK 


Facsimile No. (02) 6285 3929 




Telephone No. (02) 6283 2541 



FormPCT/IPEA/416 (July 1992) 
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NT COOPERATION TREATY 

PCX 

INTERNATIONAL PRELIMINARY EXAMINATION REPORT 

(PCT Article 36 and Rule 70) 



) 

Applicant's or agent's fi]c reference 
18134/8X109SB 


FOR FURTHER See Notxficadon of Transizxinal of IntexnadcQULl Prelizninaiy 
ACTION Examination Report (Form PCT/IPEA/4 16). 


International Application No, 
PCT/NZOO/00174 


International Filing Date (day/month/year) Priority Date (day/month/year) 
4 September 2000 2 September 1999 


International Patent Classification (IPC) or national classification and IPC 

Int. CI. ' C12N 15/31, AOlH 5/00, AOIN 63/02. C07K 14/24, C12N 15/70, C12N 15/82, C12Q 1/68 


Applicant 

NEW ZEALAND PASTORAL AGRICULTURE etal 



1 » This international preliminary examination report has been prq>ared by ftis Intemational Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 

2. This REI^ORT consists of a total of 3 sheets, including this cover sheet 

j X| Tbis report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings which Iiavc 

been amended and are the basis for this report and/or sheets containing rectifications made before this Authority (see 
Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of 5 sheet(5). 



3, This report contains indications relating to the following items; 



1 


m 


n 


□ 


in 


□ 


IV 


□ 


V 




VI 


□ 


VII 


□ 


vm 


□ 



Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



Date of submission of the demand 
26 March 2001 


Date of completion of the report 
28 November 2001 


Name and mailing address of the IPEA/AU 

AUSTRALIAK PATENT OFHCE 
PO BOX 200, WODEN ACT 2606, AUSTRALIA 
E-mail address: pct@ipaustraHa.gov.au 
Facsimile No. (02) 6285 3929 


Authorized Officer 

GARETH COOK 
Telephone No. (02) 6283 254 1 



Form PCT/IPEA/409 (Cover sheet) (July 1998) 
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I. 



INTERNATIONAL PRELIMIN. 



ATION REPORT 



ionkl application No. 
PCT^^ZOO/00174 



Basis of the report 



With regard to the elements of the international application:* 
I I the international application as originally filed. 

the description, pages 1-2,5-48, as originally filed, 
pages 



X the claims, 



[x1 ttic drawings. 



, filed with the demand, 
pages 3,4, received on 5 October 2001 with the letter of 5 October 2001 
pages 52-54, as originally filed, 

pages , as amended (together with any statement) under Article 19, 
pages , filed with ^ demand^ 

pages 49-51, received on 23 November 2001 with the letter of 23 November 2001 
pages 1-8, as originally filed, 
pages , filed with the demand, 
pages , received on with the letter of 



\ ^X^ the sequence listing part of the description: 

pages 1-46, as originally filed 
pages , filed with the demand 
f p^ges , received on with the letter of 

Widi regard to the language, all the elements marked above were available or furnished to this Authority in the language in 
which the international application was filed, unless otherwise indicated under this item. 
These elements were avaDablc or furnished to this Authority in the following language which is: 
[ ] the language of a translation furnished for the purposes of international search (under Rule 23. 1 (b)). 

[~| the language of publication of the international application (under Rule 48.3(b)), 

j I the language of the translation furnished for tfie purposes of international preliminary examination (under Rules 55.2 
and/or 55,3). 

With regard to any nucleotide and/or amino acid sequence disclosed in the intexnational application, the international 
preliminary examination was carried out on the basis of the sequence listing: 
[^1 contained in the international application in written form. 

[~ [ filed together with the international application in computer readable form. 
[ J furnished subsequently to this Authority in written fonn. 
|x] furnished subsequently to this Authority in computer readable form- 
ic The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 

international application as filed has been furnished. 
[x] The statement that the information recorded in computer readable form is identical to the written sequence listing has 

becnfurmshcd 
[~j The amendments have resulted in die cancellation of: 

I I the description, pages 

I I the claims, Nos. 

[~] the'drawings, sheets/fig. 

I I This report has been established as if (some of) the amendments had not been made, since they have been considered to 
go beyond the disclosure as filed, as indicated in the Supplemental Box (Rule 70.2(c)),** ^ 

Replacement sheeu Mfhich have been furnished to the receiving Office in response to an invisation under Article !4 are referred to in this 
report as "originally filed'' and are not annexed to this report since they do not contain amendments (Rules 70. 16 and 70. 1 7). 

Any replacement sheet containing such amendments must be referred to under item I and annexed to this report 



Form PCT/IPEA/409 (Box I) (July 1998) 
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INTERNATIONAL PRELIMIN.^^XAMINATION REPORT 



P|tionaI application No. 



h- r 

1. Statement 



V. Reasoned statement under Article 35(2) with regard to noveIt>', inventive step or industrial appUcabiUty; citations 

and explanations supporting such statement 



Novelty (N) 


Claims 


1-41 


YES 




Claims 




NO 


Inventive step (IS) 


Claims 


J-41 


YES 




Claims 




NO 


Industiial applicability (lA) 


Claims 


1-41 


YES 




Claims 




NO 



2. Citations and e3q)ianations (Rule 70.7) 



Noveltv CN) and Inventive Step (I S\ claims 1 to 41 

The claims are to a an insecticidal protein complex, the polynucleotide encoding it and uses associated with 
protein ahd polypeptide. The closest prior art is considered to be Grkovic S et al. Applied and Enyironmental 
Microbiology, 1995, 61 (6):221 8-223 which discloses the plasmid which encodes for the insecticidal protein 
complex. The docxmient however does not disclose the sequence of the polypeptides or their encoding 
polynucleotides, hence the claims are considered novel. It is also considered that it would require more than 
routine effort to identify the genes encoding the protein complex, hence the claims are considered to involve 
an inventive step. As such the claims meet the requirements of Articles 33(2) and 33(3) of the PCT. 

Industrial Applicability (lA) claims 1 to 41 

Claims 1 to 41 are considered to be Industrially Applicable under Article 33(4) of the PCT, 
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ENT COOPERATION TREATY 



From the INTERNATIONAL SEARCHING AUTHORITY 



PCT 



To: 

SIMS, ALLEN, FINCH, LEWIS, MURPHY, 

ROGERS, TYRER-HARDINS, WELLS, 

29 Clarence Street 

Private Bag 3140 

Hamilton 2001, New Zealand 

NEW ZEALAND 


NOTIFICATION OF TRANSMITTAL OF 
THE INTERNATIONAL SEARCH REPOFIT 
OR THE DECLARATION 

(PCT Rule 44.1) 


Date of mailing 

<d6y/month/^ear) 06/06/2001 ' 


App&canrs or agent's Hie reference 
18134/8X109 


FOR FURTHER ACTION See paragraphs 1 and 4 below 


iniamatfonal appHcalion No. 
PCT/NZ 00/00174 


Intfimaitonal fifing date 
(^y/monfh/yBar) 04/09/2000 


Applicant 

NEW ZEALAND PASTORAL AGRICULTURE RESEARCH INSTI... 



1 . The applicant Is hereby notftled mat the Internalional Search Report has been established and is transmitted horowJih. 
Filing of amendments and atatamant undar Artfcta 19: 

The applicant is entitled, if he so wishes, to amend the cJaime of the intemational ApplfcatUsn (see Rule 46): 

Whan? The time limit for filing such amendments Is nomialty 2 monih$ from the date of cranfimittal o< me 
Intemational Search Report however, for more details, see iha notes on the aocompanying sheeL 

Whera? Directly to the International Bureau of WlPO 
34» chemin das Colombettss 
1211 Geneva 20, Switzerland 
Fascimlle No.: <41-22) 740.14^ 

For mora detailed inslnietions, see the notes on ihe accompanying eheet. 

2. I I The applicant is hereby notified that no International Search Report wUI be establiahed and that the dadaraUon under 
^ Article t7(2)(a) to that effect Is transmitted herewith. 



3, With regard to the protest against payment of (an) additional fieB(s] under Rule 40^, the applicant is notified that 

□ the protest together with the decision thereon has k>een transmitted to the Intemational Bureau together with the 
applicant's request to forward the texts of both the protest and the decision thereon to the designated Offices. 

I I no decision has been made yet on the protest; the applicant will be notified as soon as a decision Is made. 

4. Further aetion(a): The applicant is reminded of the following: 

$hon)y attar 16 months from the priority date, the Internationa) application will be published by the Intemational Bureau. 
K the applicant wishes to avoid or postpone publication, a notice of withdrawal of the mternationaf application, or of the 
priority claim, must reach the Internationa! Bureau as provided in Rules 90bf8.1 and 90b/s.3, reepeetivety, before the 
completion of the technical preparations for intemational publication. 

Within 19 months from the priority date, a demand for interrmtfonal preliminary examination must be filed If the applicant 
wishes to postpone the entry Into the national phase until 30 months from the priority dale (in some Offices even later). 

Within 20 montha from the priority date, the applicant must perform the prescribed acts for entry into the national phase 
before all designated Offices which have not been elected In the demand or in a later election within 19 months from the 
priority data or could not t>e elected because they are not bound by Chapter 11. 



fMame and mailing address of the international Searching Autiiority 

European Patent Office. P.B, 6818 Patentlaan 2 
NL-2280 HV ROswijk 

Tel. (-I-31-70) 340-2040, Tx. 31 651 epo nL, 
Faxr (+31-70) 340-3016 



J 



Authorized officer 

Mirellle Claudepierre 
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FORM PCT/ISA/220 



Th«i« Noto» ar« intend^l to giv» the haaio in«lruct»on» conommtng the filffjo of am^ndnwto ""^.■f*?5J^i^«^i«.^ 
Not«» ba»ed on th» r.quif«t»fitm of th* P«t«H Coop.wtio»i Ti»«ly. Hhi R^l^^f f?"*."!* ^'^JS^JSJ^Hl^^^ 
under tha! Tro^. In c«w of dtoompancy botwMn theae NoIm and thoM rec^lwmwib, fho lattor are opplioabla. For more 
dotaiM jnformatktn. aao also tha PCT Applioahl% Giida, a pubKeaiton of WiPO. 

m lhaaa Nok«», -Artfde^ -Rula', and *Sacdon- rafar lo theproviaiona of tha PCT. tha PCT Ragulationa and tha PCT 
Admrniatrativa Inatnictiona raapac^y. 



INSTRUCTIONS CONCERNiNG AMENDMENTS UNDER ARTICLE 19 

Tha applioant haa, aW having racah^ad Iha ir*»fr>«iiori^ 
inlamationirf applferfon. R ahould howavar ba amphaaizad thai, ainoa all parta of tha inlamaiionBl W»»atoon (o^-nia, 
daaortption anddrawing*) may ba amoncM during tha inlamalJonal prafunimwy examinaaJpn f^foj^-^T^^ "^l^ 
no naadto IHa amandmanta of the daima undar Aitkslo 19 axoapt whara. eg. the afipTicant warita ttw l»ttaf to ba P^^^^;^ 
for tha purpoaaa of proviHona! protocltan or haa another raaaon tor amanding the dalma batora intamabonai pbulioMton. 
Furthannoro, ft ahould ba amphaatoad mat piowtaw^ 

WM paiU o< tha IfilafMtkiiial appPcadon may ba a^ 

Undar Aiiida 19, only tha olaima may ba amanded. 

Ourtnglhalntam^ionalpfwa.tfiaolarma may ato bo amandad (or further amended) under ArtWe 34 batera 
lha tntam^km^ F^Hmlnary Examinino Authority. Tha deaoriptbn and drawinga may only ba amended un«far 
AriSde 34 faafoia tha Intafnattonal Examining Authority. 

f Upon entry into the nitfonal phaaa, aH parta of lha iftentfkmal applioalien may ba amamted undar Artida 28 
or, whata appMoabla, Artkto 41. 

Whan? Wfthin 2 montha f^ the data of tranamiltal of th« inlBmational aaardi report or 1 6 montha from the priority 

data, whlch^var lima ttirat axpvaa latar. It ahould ba notad» howavar, that the amandmenta ariU ba oonwderad 
a» having bean foeaiwad on tima if lhay are raeaivad by lha Intamatkmal Buraau aftar tha ajtpir^on of tha 
qppifgffH^ lima lifflft but balbra lha conflation of tha technical proparstionB for intaitialional publioation 

(ftuio4e.i). 



Whara noltoIHa tha i 



ta7 



Offioaorthe 



The amerKlmenla may only ba f^ad ^fcnth the Intamfliionat Bureau and n^ wfth tha raoaivirH9 
Inlamaiionet Saaiching Authority (Hula 46J29. 

Whara a demand Ibr inlamdkmal praSmlnary axaminalion haa baan^a lIMp aae baloar. 



. Btliar by oanoalfmg one or inM antira fliaima, by adding one or rnofo nea^ 
one or mora of Ilia dalma i 



A raplaoement aheet miAl be aufamttted for each ehael of the dalma whidi. on account of an amandmart or 
amandmanta, <Sffara from the ahaat originaRy filed. 

Atf tha dakne ^spearing on a raplacBmard sheet muat ba numbered in Arabic nuinarab. 
aanoallad,nonmumbarino of tha olhar daima iaraqurad. In all caaaa whera daima are ranumbarad^thaymuat 
be ranumbered cone»outivaly (Adminidrativa Inalrudiona, S ed fangOS<b)>. 



Tha 



muat ba iniida in tha 



laga bi whldi thaintomsftloml application la to ba pubilahad. 



What docitmettta tntiatmiay aeoompany tha amandmanta? 
Uttar (Section 206(b}): 

Tha amandmaiifta mud tiaaubiTnttad with alaltar. 

The Mtor will not ba pubKahed with the intemationd apptlcoiion and the amended daima. I! ahould not ba 
oonf iMd with the 'Staiemerrt under Artnle 1 9(1 y (aae belcMv, under "Statement under Artide 1 9(1 )*). 

Tha N«tar muat ba tn Engllah or rrondl, at tha ohoioa of tha appHoant However, If tha lansu^ga c« tha 
Intamotlond appUoaUon la EngUah^ lha laoar muat ba In EngUah; H tha tanguaoe of tha intamailonal appHeatlon 
la Pranelv tha lattv muat be In Hmdi. 



MoCaa to Form PCT/I&A/220 (Ifrd aheal) (January 1994) 

eNSOOCIO: «X3ISA220NOENP4J.> 
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S TO FORM pCT/tSA/220 (continued 



Thd Mtfff inu»t iiK^calD th« diff«f»noM between the daimt as flM and lh« oimimm as nmendfid. must, in 
p«ftioutv, iiKSe^tt, in connaotion with eaeh datm app^anng in the international applieakion fit being undeiatood 
that adeniieal indloations eonraming several olaims may be grauped),wfiether 

(i) the eiajm » unchanged; 

(ff) the olaim it oanoelled; 

(m) the olaim b namr; 

fiv) the daim Mplaoea ona or mora daima at filed; 

M thtt claim ia thermit of the. dh^bicm of a elaim OB (ilad. 



The following enmplea Muatrata the avanner In wfilch wnandnianta must ho oxplained In the 



1. (Whare orlQlruUly tharo were 48 daims and aftor amandment of some ^aima there are 51]: 

Xldme t to 29 1 31 9 32, 34. 35, 37 ftp 48 roplaoed by amended daima bearing the eame numbeta; 
dobfto 30. 33 and 36 unohangod; new daima 48 to 51 added.* 

It fWhofo ongir»a8ythQnMeafa ISetaimaayidaftaf a ma r wi fiiantof aB olaffnathertearie t1|! 
H:idma 1 to 1 5 loplaoed by amended ddnta 1 to 1 1 .* 

a (Wharo Qitginallytherower»14damiearidttMamandmer^ 
naweiaima]: 

*Cldfna 1 to 6 and 14 unchangoc^ daima 7 to 13 oanoelad; now daima 15, 16 

"Claima 7 to 1 3 eanoeHod; now olaims 1 5, 16 and 1 7 added; al other ddma unchanged." 

^ 4. |Wherovartou> landB of ai i ia iKl ri i oi ita afo modal: 

*CU^ 1-10 undiangod; elaima 11 to 13, 13 flfid 19 oanooled; olamia 14. 15 and 16 replwed fay a^ 
doim 1 4; claan 17 Bubdhrided into amandod ddm 16. 1 6 and 17; naw 

-SCatamont under arttcki 19(1)" (two 46^) 

Ihm am^ndrnenla may be aocompanted by a otstamant explainang the amendments and indioaiing any onpact 
that Budi amendments might h^^ on the daecHption and the drawings (whidt cannot be amended under 
ArtK^l9(1)). 

The slslement wtU faa putaliahed with the international appSoctfidn and the amended daima. 
ft niiat bo In Iho foiHN^i^ N Which Iho Mamailonal opppNcsl^ 
It must t3o briofp not oxoeoding 500 words if in EngOih or if IranaldM 

tt should not be confused with and does net raplafietf^ letter indicating tha tif is i a i i o ea betwe e n the dalme 
as filed and as amended. It must be Red on a separate shed end fnuat bo identified «B audi by a tieadint^ 
pmierafaly by uaingtho words "StMementunder Aitide 19(1).* 

II may not oo niam any tfsparaging oomn w t s on the Intecnational aearoh report or tlie rotevonoe of oitatieni 
ooritainedlnthdnBpoft Rofersnoelodtations, relevant to a given daim, contained in the intomottonalsearoh 
lopoft mey be mado only in conneolion wSh an amondmont of that daim. 

Conaaquanc* If a demand for lntet«M«|oftof pivltml^^ 

lf,atthetimeof fiRng any amendments under Artida Id. a demand for international prdMnary examination 
has almady been submitted, the applicant must prsferably. d the same time of fting the amenc^ents with the 
MemitionBl Buroau. also fie a copy of such amendments with the Intemdional Prelimtnary Examining 
Auttwrily (see Rulo 6l2-2(«). fifst eemenco). 

Consequence wHh rogard to tnaiatadioA of the tntematlonal applloaUon for entry Into ttie nationd phaeo 

The applicants attention Is drawn to the fad that, where upon entry into the nsttiond phase, a tran>la6csi of the 
delme aa amended under Artida 1 8 may hmm to be furnished to the designatedMected Offioae, indeed of« or 
in adffiion to, the tranatdion of the ddma » fried. 

For hJilher dddls on Ihe requirements of eadi destgnated/deded Office, sao Volume II of tho PCT Applioant's 
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EtsIT COOPERATION TREATY 

PCT 



INTERNATIONAL SEARCH REPORT 

(PCT Article 1 8 and Rules 43 and 44) 



Applicants or agents file reference 
18134/8X109 


POR PI IPTHFR see IMotitlcation of TransmlitaJ of IntemaUonal Search Report 
run rum i ncn PCT/tSA/220) ae well as, where applicable. Item 5 betow. 
ACTION 


International application No, 
PCT/NZ 00/00174 


International flUng date (day/ntonth/yoar) 

04/09/2000 


(Eariiast) Priority Date (day/monm/yeao 

02/09/1999 


Applicant 

NEW ZEALAND PASTORAL AGRICULTURE RESEARCH INSTI... 



This Internationa) Search Report has been prepared by this intemattonal Searching Authority and ie transmitted to the appRcam 
according to Article 18. A copy Is being transmlttod to the international Bureau. 

This International Search Report coneists of a total of 6 . sheets. 

[X] It is also accompanied by a copy of each prior art documenidted In this report. 



1 . Basi^ of the report 

a. With regard to the iangu^e, the Intemattonal search was earned out on the basis of the international application In the 
language in which It was filed, unless otherwise Indicated under Ihis Item. 

PI the international search was carried out on the basis of a translation of the Intemationat application furnished to this 



b. 



Authority {Rule 23.1 <b». 

With regard to any nucleoUde and/or amino acid sequence disclosed In the International application, the inlemational search 

was carried out on the t>asi3 of the sequence listing : 

fX] oontzdned In the international appllcaiton In written lorm. 

] I filed together with me international application in computer readable fomi. 

Q furnished subsequently to this Authority In written tonn. 

pn furnished subsequently to this Authority In computer readble fomo. 

PT) the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
^ International appUcafion as filed has been furnished. 

in the statement that the Infonnafion fecorded In computer readable lonn Is Identical to the written sequence listing has been 
furnished 



2. 
3. 



pn Certain claims were found imsearchable (See Box I). 
I I Unity of invention Is lacking (see Box II), 



4. With regard to the title, 

I I the text Is approved as submitted by the applicanL 
pn the text has been established by this Authority to read as follows: 
NUCLEOTIDES SEQUENCES ENCODING INSECTIDAI* PROTEIN COMPLEX FROlrt SERRATIA 



With regard to the abstract, 

pr[ the text is approved as submitted by the applicant 

□ the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may, 
within one month from the date of mailing of this international search report, submit comments to this Authority. 



The figure of the drawingfi to be published with the abstract Is Figure No. 



I I as suggested by the applicanu H None of the figures. 

I I because the applicant failed to suggest a figure. 
I I because this figure better characterizes tt>e invenfion. 
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?0^^_OIO/00174^ 



onat Application No 



A. CLASSinCATION OF SUBJECT MATTEf^ . , , 

IPC 7 C12N15/31 C12N15/70 C12N15/82 C07K14/24 C12Q1/68 
A01N63/02 A01H5/00 

AccortSns 1o intefnattonal Patent GlaBSfficatkm <iPC) or to t)Oth national daasfficatton and IPC 



B. RELDS SEARCHED 



Minimum documentation neardidd (dassmcaiton aystem followed by classification symbols) 

IPC 7 C12N C07K AOIH C12Q 



Documentation seatched other than minimum documentation to thd fisdont that such Oocuments are Included in the fieldB seardied 



Electronic data base omsuHed during tho Intematlonal search (name of data base and. where pmctlcal. search terms used) 

BIOSIS, EPO-Internal , STRAND, WPI Data, PAJ 



C. DOCUMENTS CONSIDERED TO BE NELEVAMT 



Caiogofy ' CftaMon or document, wtth moicailon, wt>©rB appiopriafs, oftho relevant passages 



Relevant to claim Noi 



JACKSON T A ET AL: "PATHOGEN TO PRODUCT 
DEVELOPMENT OF SERRATIA-ENTOMOPHILA 
ENTEROBACTERIACEAE AS A COMMERCIAL 
BIOLOGICAL CONTROL AGENT FOR NEW ZEALAND 
GRASS GRUB COSTELYTRA-ZEALANDICA" 
JACKSON, T. A. AND T, R. GLARE (ED.). USE 
OF PATHOGENS IN SCARAB PEST, 

1992, pages 191-198, XP000997900 
0-946707-35-9. 1992 
the whole document 

-/— 



32 



Further docunwnts aro listed In the continuation of boc C. 



0 



Patent ramMy members are listed In annex. 



• Special categories of cbed documents : 

*A* documant denning the general state of the an vvhich is not 
considered to be of particular relevance 

'E' darUer document but pubitshedon or alter the imemalionat 
fumgdaie 

*L" document which may Vbucth doubts on pnortiy datm(3) or 
Which IS cBed to establish the publication date of another 
cRailon or other spedai reason (as specTied) 

"O" documen) referring to an oral disclosure, use. exhlbKlon or 
other mearts 

'P* document published prior to the international flHng date but 
later than the priority date claimed 



T* later doctiment publshed after the intfimaiionB) filing date 
or priori date and not In connici with the application but 
died to understand tne principle or theofy underlying the 
invention 

•X' document of particular relevance; the claimed invention 
cannot be ooneidered novel or cannot be considered to 
involve an invemive step when the document is taken alone 

•Y' document of panlculBr relevance; the clamed invention 

cannot be oonsiderad to involve an inventive step when the 
document te combined with one or mom other such docu- 
ments* such combtnallon being obvious to a peraon eldned 
in the art 

*&* document memt)er of the same patent family 



Date or the actual completion of the International eaarch 



23 May 2001 



Dale oT mailing of the International search report 



06/06/2001 



Name and nnalBng address of the ISA 

European Patent OtTloe, P. a 5818 PatOhtlaan 2 
NU - 2280 HV Rqswijk 
Tel. (-►31-.70) 340-2040, Tx- 31 6S1 ©po nl, 
Fax: (+31-70) 340-3016 



Authorized oincer 



Holtorf, S 



Foim PCT/ISAA10 (attcond 8he«t) (Juy 1892} 



page 1 of 3 
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C^ContlnuaUon) DOCUMElTTS CONStDERa^b BE RELEVANT 



#- 



lattenalApplieation No 

Pq|^^^0l0/00174_ 



Category* CllatkwoiaoaimenU with Imlicstion.wnArBBppropfiate. of the relevant passages 



Relevant to claim no. 



GRKOVIC STEVE ET AL: "Genes Essential for 

Amber Diseae in Grass Grubs Are Located on 

the Large Plasmid Found in Serratia 

entomophlla and Serratia proteamaculans." 

APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 

vol. 61, no. 6, 199&, pages 2218-2223, 

XP000994573 

ISSN: 0099-2240 

dted in the application 

the whole docufnent 

GLARE TRAVIS R ET AL: "Plasmid transfer 

among several members of the family 

Enterobacterlaceae increases the number of 

species capable of causing experimental 

amber disease in grass grub." 

FEMS MICROBIOLOGY LETTERS, 

vol. 139, no. 2-3, 1996, pages 117-120, 

XP000998482 

ISSN: 0378-1097 

cited in the application 

the whole document 

WO 99 42589 A (NOVARTIS ERFIND VERWALT 
GMBH ;N0VARTIS AG (CH); KRAMER VANCE CARY) 
26 August 1999 (1999-08-26) 
the whole document 

WO 98 08932 A (DOU AGROSCIENCES LLC 
;UISCONSIN ALUMNI RES FOUND (US)) 
5 March 1998 (1998-03-05) 
the whole document 

WO 98 08388 A (MORGAN JAMES ALUN WYNNE 
;JARRETT PAUL (6B); ELLIS DEBORAH JUNE 
(GB) 5 March 1998 (1998-03-05) 
the whole document 

WO 97 17432 A (WISCONSIN ALUMNI RES FOUND) 
15 May 1997 (1997-05-15) 
the whole document 

BOWEN D ET AL: "INSECTICIDAL TOXINS FROM 

THE BACTERIUM Photorhabdus Urolnescens" 

SCIENCE, AMERICAN ASSOCIATION FOR THE 

ADVANCEMENT OF SCIENCE,, US, 

vol. 280, 26 June 1998 (1998-06-26), pages 

2129-2132, XP0G21 15650 

ISSN: 0036-8075 

cited in the application 

-/— 



Form PCT/tSA/21 0 (contmufition or aeoood eheei) (July 1992) 
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Application No 

00/00174 



C^ContinuaUon) OOCUMENTS CONSIDEREtTTO BE RELEVANT 



Caiegory* 


Ctation ordocsument, with lndicatlon,wherB approp/iate, of the relavant paeaaoRB 


Relevant to ctafm No. 


A 


NUNEZ-VALDEZ M E ET AL: "The ainb2 locus 
from Serratla entomophila confers 
ant 1 -feeding effect on larvae of 
Costelytra zealandlca (Coleoptera: 
Scarabaeldae)" 

GENE, NL, ELSEVIER BIOMEDICAL PRESS. 

AMSTERDAM, 

vol. 172, no. 1, 

12 June 1996 (1996-06-12), pages 75-79, 

XP004042712 

ISSN: 0378-1119 

cited In the application 




P.X 


HURST MARK R H ET AL: "Plasmld-located 
pathogenicity determinants of Serratia 
entomophila, the causal agent of amber 
disease of grass grub, show similarity to 
the Insectlcldal toxins of Photorhabdus 
lumlnescens." 
JOURNAL OF BACTERIOLOGY, 
f vol. 182, no. 18, 

September 2000 (2000-09), pages 5127-5138, 
XP002166799 
ISSN: 0021-9193 
the whole document 


1-4, 
9-16, 
21-27, 
31.41 



Focm PCTyiSA/2'lO (cvniniMtien of saoond ihem) (July 1982) 
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international Application No. PCTMZ 00 ;00174 



FURTHER INFORMATION CONTINUED FROM PCTilSA/ 210 



Continuation of Box 1.2 
Claims Nos. : 17 

Present clalni 17 relates to a llgand defined by reference to a desirable 
characteristic or property, namely binding to the polypeptide of claim 
15; 



The claim covers all products having this characteristic or property, 
whereas the application provides no support within the meaning of Article 
6 PCT and/or disclosure within the meaning of Article 5 PCT for such 
products . 

In the present case, the claims so lack support, and the application so 
lacks disclosure, that a meaningful search over the whole of the claimed 
scope is impossible. Independent of the above reasoning, the claims also 
lacjc clarity (Article 6 PCT). An attempt is made to define the product by 
reference to a result to be achieved. 

Again, this lack of clarity in the present case is such as to render a 
meaningful search over the whole of the claimed scope impossible. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
Is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched- This Is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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INTERNATiO! 



I^^EARCH REPOFtT ^ 



.^tlonal appllcadon No. 
IPCT/NZ 00/00174 



Box I Observations whm certain claims wer found unsearchable (Cootinuation of item 1 of first sham) 



ThiB mtemational Search Report has not been established in respect of certain claims under Article 17(2)(a) for the foliowing reasons: 
1. l_\ Claims Nos.: 

^""^ because they relate to subject matter not required to t>e searched by this Auttiortty. namely: 



Claims Nos.: 17 

because they relate to pans of the international Application that do not comply with the prescnt>ed requirements to such 
an exiant that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. n CtalmaNos.: 

— because they are dependent claims and are not drafted in aocordanoe wHh the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of inventian Is lacMng (Continuation of item 2 of first sheet) 



This InteniatlonaJ Searching Authortty tound multiple inventions In this Intemattonal application, as follows: 



1 . I I AS all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' — * searchable dalms, 

2, \^ As alt searchabia claims could be searched without effort justifying an addittonat lee, this Authority did not Invlie payment 

of any additional 'fee. 



3. J I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
I— I covers only those claims for which fees were paid, specifically claims Noe.: 



4. [_| No repuiredadditiormJ search fees were timejy paid by the applicant Consequently, mis Inter^^ 
restricted to the invention first mentioned in the claims; it is covered by claims Noe.: 



Remark on f>rotest [ [ The ad<fitienaJ search fees were accompanied by the applicanrs protest, 

I j No protest accompanied the payment of additional search fees. 



f=orm PCT/lSA/210 (continuation of first sheet (1)) (July 1998) 
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INT^ ATIONAL SEARCH REPORT 

Inf orn^^^ cm patent family membore 


• ' 1 


iiitinii^Si Hint ADOllcatlon No 

pdH^ 00/00174 


Patent document 
clfod In search report 


Puk)licaik>n 
data 


Patem family 
member{8) 


PuWicattOn 
date 



UO 9942589 



26-08-1999 



AU 
EP 



3028699 A 
1054972 A 



06-09-1999 
29-11-2000 



WO 9808932 



05-03-1998 



AU 
AU 
AU 
BR 
BR 
CA 
EP 
EP 
HU 



729228 B 
1050997 A 
2829997 
9606889 
9711441 
2209659 
0797659 
0970185 



A 
A 
A 
A 
A 
A 



9900768 A 



JP 2000515024 



PL 
PL 
SK 
SK 
TR 
UO 



321212 
332033 
24699 
93197 
9901126 
9717432 



25-01- 
29-05- 
19-03- 
28-10- 
24-10- 

15- 05- 
01-10- 
12-01- 
28-06- 

14- 11- 
24-11- 

16- 08" 
10-04- 
06-05- 
21-07- 
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The present invention concerns novel nucleotide sequences encoding insecticidal proteins 
from the Enterobacteriaceae, Serratia entomophila and Serratia proteamaculans, and the 
5 use of said nucleotide sequences and insecticidal proteins. 

BACKGROUND Art 

Some Serratia entomophila and Serratia proteamaculans strains in New Zealand are 
knov^n to cause a disease in the major scarab pest, Costelytra zealandica (New Zealand 
grass grub). The disease was first discovered and described by Trought and Jackson (1982) 
10 and was later named amber disease after the distinctive colour of affected insects (Stucki et 
al. 1984). One species capable of causing the disease, Serratia entomophila, was 
developed into a conunercially-available product ("Invade") in 1989. 

The disease is highly host specific, only know to infect a single indigenous species of New 
Zealand scarab larva. The disease appears unique among insects and results not from rapid 

15 invasion of the haemocoel, but from a slow colonisation of the gut. The disease has a 
distinct phenotypic progression, with infected hosts ceasing feeding within 2-5 days of 
ingesting pathogenic cells. The normally black gut clears around this time (Jackson et al. 
1993) and the levels of the major gut digestive enzymes (trypsin and so forth) decreases 
sharply (Jackson, 1995). The clearance of the gut results in a characteristic amber colour of 

20 the infected hosts. The larvae may remain in this state for a prolonged period (1-3 months) 
before bacteria eventually invade the haemocoel, causing rapid death. 

The finding of a plasmid that apparently encoded the disease was reported in Glare et al. 
(1993) by showing a correlation between pADAP presence and disease occurrence in 
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bacterial strains. This was further confirmed by Glare et al. (1996) who showed that 
transfer of the plasmid from pathogenic to non-pathogenic strains resulted in a change to 
pathogenic. 

Grkovic et al. (1995) showed that disruption of the plasmid by transposon insertion could 
5 alter pathogenicity without fully defining the area containing the gene cassette. By marker 
exchange, they showed that a lO.Skb HindUl (pGLA20) construct from pADAP encoded 
some functions of amber disease. However, the clone did not contain all disease encoding 
plasmid-bome regions. 

Another region involved in amber disease encoding was located by Nunez- Valdez and 
10 Mahanty (1996). They located a locus, amb2, by transposon mutagensis and searching a 
cosmid genomic library. This region was chromosomally located and was involved in 
antifeeding in the larvae of Costelytra zealandica. However, the current applicant's 
research has demonstrated that the amb2 region is located on pADAP remote from the 
virulence gene and is probably regulatory in function. 

15 Insecticidal toxins which share some protein homology to the Serratia insecticidal proteins 
of the present invention have been recently discovered (PCTAJS96/ 18803; 
PCT/US97/07657) by a group at Wisconsin University (Blackburn et al. 1998; Bowen et al. 
1998; Bowen and Ensign 1998). These insecticidal toxins are produced from a gene region 
in Photorhabdus luminescens which resembles the Serratia virulence region in the 

20 clustering of the genes and at the protein level, but has very little DNA homology with the 
Serratia genes. They have shown high molecular weight proteins from Photorhabdus 
luminescens are insecticidal to a number of insects from different orders. The lack of DNA 
homology over the majority of the region, as opposed to protein homology, between the 
Serratia genes and Photorhabdus genes suggests that these proteins have evolved as a 

25 result of convergent evolution leading to the formation of a distinct protein family with a 
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The present applicant has now found that three regions of the pADAP plasmid are required 
for full insecticidal function. Sequence analysis of these three regions has shown that the 
present applicant has isolated and identified a novel toxin from Serraiia species that 
belongs to a new family of insecticidal toxins. It is broadly to this toxin that the present 
invention is directed. 

DISCLOSURE OF INVENTION 

According to a first aspect of the present invention, there is provided an isolated nucleic 
acid molecule comprising a nucleotide sequence of SEQ ID NO: 1 which encodes an 
insecticidal protein complex, or a functional fragment, neutral mutation, or homolog 
thereof which have at least 75% nucleic acid homology to SEQ ID NO: 1 and are capable 
of hybridising with said nucleic acid molecule under stringent hybridisation conditions. 

The invention also provides an isolated nucletc acid molecule comprising the nucleotide 
sequence 1995-18937 of SEQ ID NO: 1 which encodes an insecticidal protein complex, or 
a functional fragment, neutral mutation, or homolog thereof capable of hybridising with 
said nucleic acid molecule under standard hybridisation conditions. 

The invention also provides an isolated nucleic acid molecule comprising one or more of 
the nucleotide sequences 2411-9547, 9589-13883 or 14546-17467 of SEQ ED NO: 1 which 
encode insecticidal proteins, or a functional fragment, neutral mutation, or homolog thereof 
capable of hybridising with said nucleic acid molecule under standard hybridisation 
conditions. 

Preferably the nucleic acid molecule comprises all of nucleotide sequences 2411-9547, 
9598- 1 3884 and 14546- 1 7467 of SEQ ID NO; 1 . 
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The invention further relates to an isolated nucleic acid molecule comprising a sequence of 
SEQ ID NO: 1, nucleotides 1955-18937 of SEQ ID NO: I or one or more of nucleotides 
241 1-9547, 9598-138S4 or 14546-17467 of SEQ ID NO: 1, operably linked to at least one 

m 

further nucleotide sequence which encode an insecticidal protein. For example, the at leasi 
5 one further nucleotide sequence may be the nucleotide sequence which codes for the 
Bacillus delta endo toxins, vegatative insecticidal proteins (vips), cholesterol oxidases, 
Clostridium bifermentens mosquitocidal toxins and/or Photorhabadus luminescens toxins 
and so forth. 

The nucleic acid molecule may comprise DNA, cDNA or RNA. 

10 Preferably said fragment, neutral mutation or homolog thereof is capable of hybridising to 
said nucleic acid molecule under stringent hybridisation conditions. 

The invention further relates to nucleic acid molecules which hybridise to the nucleotide 
sequence of SEQ ID NO: h or nucleotides 1955-18937, 241 1-9547, 9598-13884 or 14546- 
17467 of SEQ ED NO: 1 if there is at least 75% or greater identity between the sequences. 

15 The nucleic arid molecule may be isolated from Serratia entomophila or Serratia 



proteanuiculans strains. 

Also provided by the present invention are recombinant expression vectors containing the 
nucleic acid molecule of the invention and hosts transformed with the vector of the 
invention capable of expressing a polypeptide of the invention. 

20 The vector may be selected from any suitable natural or artificial plasmid/vector. For 
example, pUC 19 (Yannish-Perron et al. 1995), pProEX HT (GibcoBRL, Gaithersburg, 
MD, USA), pBR322 (Bolivar et al. 1977), pACYCl84 (Chang et al. 1978), pLAFR3 
(Staskowicz et al. 1987), and so forth. 
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In a further aspect, the invention provides a method of producing a polypeptide of the 
invention comprising the steps of: 

(a) culturing a host cell which has been transformed or transfected with a vector as 
defined above to express the encoded polypeptide or peptide; and 

5 (b) recovering the expressed polypeptide or peptide. 

An additional aspect of the present invention provides a ligand that binds to a polypeptide 
of the invention. Most usually, the ligand is an antibody or antibody binding fragment. 
Such ligands also form a part of this invention. 

According to a further aspect of the present invention there are provided probes and primers 
10 comprising a fragment of the nucleic acid molecule of the invention capable of hybridising 
under stringent conditions to a native insecticidal gene sequence. Such probes and primers 
are useful, for example, in studying the structure and function of this novel gene and for 
obtaining homologs of the gene from bacteria other than Serratia sp. 

According to a still further aspect of the present invention there is provided a polypeptide 
15 having insecticidal activity encoded by the nucleic acid molecule of the invention, or a 
functional fragment, neutral mutation or homolog thereof. 

The polypeptide may comprise the amino acid sequence of SEQ ID NO: 1 or a functional 
fragment, neutral mutation or homolog thereof. 

The polypeptide may comprise amino acids 32-51 18 of SEQ ID NO: 1. 

20 The polypeptide may comprise at least one amino acid sequence of SEQ ID NO: 2; SEQ ID 
NO: 3; SEQ ID NO: 4; SEQ ID NO: 5 or SEQ ID NO: 6. 

Preferably the polypeptide comprises amino acid sequence SEQ ID NO: 4; SEQ ID NO: 5 
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More preferably the polypeptide comprises all of SEQ ID NOs: 2-6, 

Conveniently, the polypeptide of the invention is obtained by expression of a DNA 
sequence coding therefore in a host cell or organism. 

5 The polypeptide may comprise the amino acid sequence of SEQ ID NO: 1 linked to at least 
one further amino acid sequence encoding an insecticidal protein. For example, the at least 
one further amino acid sequence may be the amino acid sequence which codes for Bacillus 
delta endo toxins, vegatative insecticidal proteins (vips), cholesterol oxidases, Clostridium 
bifermentens mosquitocidal toxins and/or Photorhabadus luminescents toxins etc. 

10 The invention further relates to polypeptides comprising at least 50%, preferably 60%, 
more preferably 70% and most preferably 90-95% or greater identity to SEQ ID NO: 1. 

The polypeptide may be produced by expression of a vector comprising the nucleic acid 
molecule of the invention or a functional fragment, neutral mutation or homolog thereof, in 
a suitable host cell. 

15 According to a further aspect, there is provided an insecticidal composition comprising at 
least the polypeptide of the invention and an agriculturally acceptable carrier such as would 
be known to a person skilled in the art. More than one polypeptide of the invention can of 
course, be included in the composition. In addition, the composition may comprise one or 
more additional pesticides, for example, compounds known to possess herbicidal, 

20 fungicidal, insecticidal or nematicidal activity. 

The composition may further comprise other known insecticidally active agents, such as 
Bacillus delta endo toxins, vegatative insecticidal proteins (vips), cholesterol oxidases, 
Clostridium bifermentens mosquitocidal toxins and/or Photorhabadus luminescents toxins 
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and SO forth. 

According to a further aspect, there is provided a method of combating pests, especially 
insects at a locus or host for the pest infested with or liable to be infested therewith, said 
method comprising applying to a locus, host and/or the pest, an effective amount of the 
5 polypeptide of the invention that has functional insecticidal activity against said pest. 

According to a further aspect the invention provides a method of inducing amber disease or 
like condition in insects comprising delivery to an insect an effective amount of the 
polypeptide of the invention that has functional insecticidal activity against said insect. 

The insect may be selected from the order comprising Coleoptera (such as the black beetle, 
10 Heteronychus arator (F.), or the black vine weevil, Otiorhynchus sulcatus (F.)); 
Dictyoptera teg. The German cockroach, Blattella germanica (L.), or the subterranean 
termite Coptotermes spp,); Diptera (eg. the housefly Musca domestica L. or the blowfly 
Lucillia cuprina (Wiedermann); Orthoptera (eg. The black field cricket Telleogryllus 
commodus (Walker) or the migratory locust Locusta migratoria L.); Hymenoptera (eg. The 
15 German wasp, Vespula germanica F.)); Hemiptera (such as the green vegetable bug Nezara 
viridula (L.) or the green peach aphid Myzus persicae (Sulzer)) the Lepidoptera (eg. the 
tomato fruitworm, Helicoverpa armigera (Walker), or the codling moth, Laspeyresia 
pomonella (L.)). 

The insecticidal polypeptide may be delivered to the insect orally either as a solid bait 
20 matrix, as a sprayable insecticide sprayed onto a substrate upon which the insect feeds, 
applied directly to the soil subsurface or as a drench or is expressed in an transgenic plant, 
bacterium, virus or fungus upon which the insect feeds, or by any other suitable method 
which would be obvious to a person skilled in the art. 

According to a further aspect, the invention provides a transgenic plant, bacterium vims or 



7 



wo 01/16305 




PCT/NZOO/00174 



fungus, incorporating in its genome, a nucleic acid molecule of the invention providing the 
plant, bacterium virus or fungus with an ability to express an effective amount of an 
insecticidal polypeptide. 

DEFINITIONS AND METHODS 

5 The foUov/ing definitions and methods are provided to better define the present invention 
and to guide those of ordinary skill in the art in the practice of the present invention. 

Definitions of common terms in molecular biology may also be found in Lewin, Genes V, 
Oxford University Press: New York, 1994. 

The term "native" refers to a naturally-occurring nucleic acid or polypeptide, including, 
10 wild-type sequence and alleles thereof. 

A "homolog" has at least one of the biological activities of the nucleic acid or polypeptide 
of the invention and comprises at least 50-70% identical amino acid or nucleic acid 
sequence thereto, preferably 75-85% and most preferably 90-95% identical amino acid or 
nucleic acid sequence thereto. 

15 The term "neutral mutation" means a mutation, (that is - a change in the nucleotide or 
polypeptide sequence such as by deletion, substitution, inversion or insertion, any of which 
have no effect on the function of the encoded protein). 

As indicated above, also possible are variants of the polypeptide or peptide that differ from 
the native amino acid sequence by insertion, substitution or deletion of one or more amino 
20 acids. Where such a variant is desired, the nucleotide sequence of the native DNA is 
altered appropriately. This alteration can be made through elective synthesis of the DNA. 
or by modification of the native DNA by, for example, site specific or cassette mutagenesis. 
Preferably, where portions of cDNA or genomic DNA require sequence modifications, site- 
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Specific primer directed mutagenesis is employed using techniques standard in the art. 

In a further aspect, the present invention consists in replicable transfer vector suitable for 
use in preparing a polypeptide of the invention. These vectors may be constructed 
according to techniques well known in the art, or may be selected from cloning vecotrs 
5 available in the art. 

The cloning vector may be selected according to the host or host cell to be used. Useful 
vectors will generally have the following characteristics: 

(a) the ability to self-replicate; 

(b) the possession of a single target for any particular restriction endonuclease; and 

10 (c) desirably, carry genes for a readily selectable marker such as antibiotic resistance. 

Two major types of vector possessing these characteristics are plasmids and bacterial 
viruses (bacteriophages or phages). Presently preferred vectors include plasmids pMOS- 
Blue, pGem-T and pUC8. 

The nucleic acids of the present invention can be free in solution, or attached by 
15 conventional means to a solid support, or present in an expression vector or any other type 
of plasmid. ^ 

The term "isolated" means substantially separated or purified away from contaminating 
sequences in the cell or organism in which the nucleic acid naturally occurs and includes 
nucleic acids purified by standard purification techniques as well as nucleic acids prepared 
20 by recombinant technology and those chemically synthesised. 

The terms "DNA construct" means a construct incorporating the nucleic acid molecule of 
the present invention, or a fractional fragment, neutral mutation or homolog thereof in a 
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position whereby the protein coding sequence is under the control of an operably linked 
promoter capable of expression in a plant cell. Such promoters are well known in the art. 

A fragment of a nucleic acid molecule according to the present invention is a portion of the 
nucleic acid that is less than full length and comprises at least a minimum length capable of 
5 hybridising specifically with a nucleic acid molecule according to the present invention (or 
a sequence complementary thereto) under stringent conditions as defined below. A 
fragment according to the present invention has at least one of the biological activities of 
the nucleic acid or polypeptide of the present invention. 

Nucleic acid probes and primers can be prepared based on nucleic acids according to the 
10 present invention (for example, the sequence of SEQ ID NO: 1). A "probe" comprises an 
isolated nucleic acid attached to a detectable label or reporter molecule well known in the 
art. Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and 
enzymes. 

"Primers" are short nucleic acids, preferably DNA oligonucleotides 15 nucleotides or more 
15 in length, which are annealed to a complementary target DNA strand by nucleic acid 
hybridisation to form a hybrid between the primer and the target DNA strand, then 
extended along the target DNA strand by a polymerase, preferably a DNA polymerase. 
Primer pairs can be used for amplification of a nucleic acid sequence, (for example, by the 
polymerase chain reaction (PGR) or other nucleic acid amplification methods well known 
20 in the art). PCT-primer pairs can be derived from the sequence of a nucleic acid according 
to the present invention, (for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5© 1991, Whitehead Institute for Biomedical Research, 
Cambridge, MA)). 

Methods for preparing and using probes and primers are described, for example, in 
25 Sambrook et al. Molecular Cloning: A Laboratory Manual, 2"^ ed. vol. 1-3, ed Sambrook 
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et al. Cold Spring Harbour Laboratory Press, Cold Spring Harbour, NY, 1989. 

Probes or primers can be free in solution or covalcntly or noncovalendy attached to a solid 
support by standard means. 

The term "operably linked" means a first nucleic acid sequence linked to a second nucleic 
5 acid sequence when the first nucleic acid sequence is placed in a functional relationship 
with the second nucleic acid sequence. For instance, a promoter is operably linked to a 
coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Generally, operably linked DNA sequences are contiguous and, where necessary 
to join two protein coding regions, in reading frame. 

10 The DNA molecules of the invention may be expressed by placing them in operable linkage 
with suitabfe control sequences in a replicable expression vector. Control sequences may 
include origins of replication, a promoter, enhancer and transcriptional terminator 
sequences, amongst others. The selection of the control sequence to be included in the 
expression vector is dependent on the type of host or host cell intended to be used for 

15 expressing the DNA. 

A "recombinant" nucleic acid is one that has a sequence that is not naturally occurring or 
has a sequence that is made by an artificial combination of two otherwise separated 
segments of sequence. This artificial combination is often accomplished by chemical 
synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic 
20 acids (for example, by genetic engineering techniques). 

Techniques for nucleic acid manipulation are described generally in, for example, 
Sambrook etal. (1989). 

Large amounts of a nucleic acid according to the present invention can be produced by 
recombinant means well known in the art or by chemical synthesis. 
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Natural or synthetic nucleic acids according to the present invention can be incorporated 
into reconnbinant nucleic acid constructs, typically DNA constructs, capable of introduction 
into and replication in a host cell. Usually the DNA constructs will be suitable for 
replication in a unicellular host, such as E. coli or other commonly used bacteria, but can 
5 also be introduced into yeast, mammalian, plant or other eukaryotic cells. 

Preferably, such a nucleic acid construct is a vector comprising a replication system 
recognised by the host For the practice of the present invention, well known compositions 
and techniques for preparing and using vectors, host cells, introduction of vectors into host 
cells and so forth., are employed, as discussed, inter alia, in Sambrook et al (1989). 

10 A cell, tissue, organ, or organism into which has been introduced a foreign nucleic acid, 
vSuch as a recombinant vector, is considered "transformed" or "transgenic". The DNA 
construct comprising a DNA sequence according to the present invention that is present in 
a transgenic host cell, particularly a transgenic plant, is referred to as a "transgene". The 
term "transgenic" or "transformed" when referring to a cell or organism, also includes; 

15 ( 1 ) progeny of the cell or organism, and 

(2) plants produced from a breeding program employing such a "transgenic" plant as a 
parent in a cross and exhibiting an altered phenotype resulting from the presence of 
the recombinant DNA construct. 

Generally, procaryotic, yeast, insect, or mammalian cells are useful hosts. Also included 
20 within the term hosts are plasmid vectors. Suitable procaryotic hosts include E. coli. 
Bacillus species and various species of Pseudomonas. Commonly used promoters such as 
p-lactamase (penicillinase) and lactose (lac) promoter systems are all well known in the art. 
Any available promoter system compatible with the host of choice can be used. Vectors 
used in yeast are also available and well known. A suitable example is the 2 micron origin 
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of replication plasmid. 

Similarly, vectors for use in mammalian cells are also well known. Such vectors include 
well known derivatives of SV-40, adenovirus, retrovirus-derived DNA sequences, Herpes 
simplex virus, and vectors derived from a combination of plasmid and phage DNA. 

5 Further eucaryotic expression vectors are known in the art (for example in P.J. Southern 
and P. Berg, 7. Mol AppL Genet. 1 327-341 (1982); S. Subramani et al., Mol Cell Biol, 1, 
854-864 (1981); RJ. Kaufmann and P. A. Sharp, ''Amplification and Expression of 
Sequences Cotransfected with a Modular Dihydrofolate Reducase Complementary DNA 
Gene, 7. Mol Biol, 159, 601-621 (1982); R.J. Kaufmann and P.A. Sharp, Mol. Cell, Biol 
10 159, 601-664 (1982); S.L Scahill et al., ''Expressions and Characterisation of the Product 
of a Human Immune Interferon DNA Gene in Chinese Hamster Ovary Cells" Proc. Natl 
Acad. Sci. USA. 80, 4654-4659 (1983); G. Urlaub and L.A. Chasin. Proc. Natl Acad. ScL 
USA. 77, 4216-4220, (1980). 

The expression vectors useful in the present invention contain at least one expression 
15 control sequence that is operatively linked to the DNA sequence or fragment to be 
expressed. The control sequence is inserted in the vector in order to control and to regulate 
the expression of the cloned DNA sequence. Examples of useful expression control 
sequences are the lac system, the trp system, the tac system, the trc system, major operator 
and promoter regions of phage lambda, the glycolytic promoters of yeast acid phosphatase, 
20 (for example, Pho5), the promoters of the yeast alpha-mating factors, and promoters 
derived fron\ polyoma, adenovirus, retrovirus, and simian virus (for example, the early and 
late promoters of SV-40), and other sequences known to control the expression of genes of 
prokaryotic and eucaryotic cells and their viruses or combinations thereof. 

In the construction of a vector it is also an advantage to be able to distinguish the vector 
25 incorporating the foreign DNA from unmodified vectors by a convenient and rapid assay. 
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Reporter systems useful in such assays include reported genes, and other detectable labels 
which produce measurable colour changes, antibiotic resistance and the like. In one 
preferred vector, the P-galactosidase reporter gene is used, which gene is detectable by 
clones exhibiting a blue phenotype on X-gal plates. This facilitates selection. In one 
5 embodiment, the p-galactosidase gene may be replaced by a polyhedrin-encoding gene; 
which gene is detectable by clones exhibiting a white phenotype when stained with X-gal. 

This blue-white colour selection can serve as a useful marker for detecting recombinant 
vectors. 

Once selected, the vectors may be isolated from the culture using routine procedures such 
10 as freeze-thaw extraction followed by purification. 

For expression, vectors containing the DNA of the invention to be expressed and control 
signals are inserted or transformed into a host or host cell. Some useful expression host 
cells include well-known prokaryotic and eucaryotic cells. Some suitable prokaryotic hosts 
include, for example, E. coli, such as E. colU S G-936, E. coli HB 101, K coli W3110, E. 
15 coli XI 776. E, coli, X2282, £. coli DHT and £. coli MROl, Pseudomonas, BacilluS:, such 
as Bacillus subtilis and Streptomyces. Suitable eucaryotic cells include yeast and other 
fungi, insect, animal cells, such as COS cells and CHO cells, human cells and plant cells in 
tissue culture. 

Depending on the host used, transformation is performed according to standard techniques 
20 appropriate to such cells. For prokaryotes or other cells that contain substantia] cell walls, 
the calcium treatment process (Cohen, S N Proceedings, National Academy of Science, 
USA 69 21 10 (1972)) may be employed. For mammalian cells without such cell walls the 
calcium phosphate precipitation method of Graeme and Van Der Eb, Virology 52:546 
(1978) is preferred. Transformations into plants may be carried out using Agrobacterium 
25 tumefaciens (Shaw et al., Gene 23:3 15 (1983)) or into yeast according to the method of Van 
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Solingen et al. 7. BacU 130:946 (1977) and Hsiao et al. Proceedings, National Academy of 
Science, 76:3829(1979). 

Upon transformation of the selected host with an appropriate vector the polypeptide, or 
peptide encoded can be produced, often in the form of fusion protein, by culturing the host 
5 cells. The polypeptide, or peptide, of the invention may be detected by rapid assays as 
indicated above. The polypeptide, or peptide, is then recovered and purified as necessary. 
Recovery and purification can be achieved using any of those procedures known in the art, 
for example by absorption onto the elution from an anion exchange resin. This method of 
producing a polypeptide, or peptide, of the invention constitutes a further aspect of the 
10 present invention. 

Host cells transformed with the vectors of the invention also form a further aspect of the 
present invention. 

Methods for chemical synthesis of nucleic acids are well known and can be performed, for 
example, on commercial automated oligonucleotide synthesisers. 

15 The term "stringent conditions" is functionally defined with regard to the hybridisation of a 
nucleic acid probe to a target nucleic acid (for example, to a particular nucleic' acid 
sequence of interest) by the hybridisation procedure discussed in Sambrook et al. (1989) at 
9.52-9.55 and 9.56-9.58. 

Regarding the amplification of a target nucleic acid sequence (for example,, by PGR) using 
20 a particular amplification primer pair, stringent conditions are conditions that permit the 
primer pair to hybridise only to the target nucleic acid sequence to which a primer having 
the corresponding wild type sequence (or its complement) would bind. 

Nucleic acid hybridisation is affected by such conditions as salt concentration, temperature, 
or organic solvents, in addition to the base composition, length of the complementary 
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Strands, and the number of nucleotide base mismatches between the hybridising nucleic 
acids, as will^be readily appreciated by those skilled in the art. 

When referring to a probe or primer, the term "specific for (a target sequence)" indicates 
that the probe or primer hybridises under stringent conditions only to the target sequence in 
5 a given sample comprising the target sequence. 

The term "protein (or polypeptide)" refers to a protein encoded by the nucleic acid 
molecule of the invention including fragments, mutations and homologs having the same 
biological activity (for example, insecticidal activity). The polypeptide of the invention can 
be isolated from a natural source, produced by the expression of a recombinant nucleic acid 
10 molecule or be chemically synthesised. 

Peptides having substantial sequence identity to the above-mentioned peptides can also be 
employed in preferred embodiments. Here, "substantial sequence identity" means that two 
peptide sequences, when optimally aligned, such as by the programs GAP or BESTFIT 
using default gap weights, share at least 80% sequence identity, preferably at least 90% 
15 sequence identity, more preferably at least 95% sequence identity or more. Preferably, 
residue positions that are not identical differ by conservative amino acid substitutions. For 
example, the substitution of amino acids having similar chemical properties such as charge 
or polarity are not likely to effect the properties of a protein. Examples include glutamine 
for asparagine, or glutamic acid for aspartic acid. 

20 Brief Description of drawings 



The invention will be further defined by reference to the specification and the following 
examples and figures herein. 



Figure 1 



shows restriction maps of clones used to isolate the pathogenic region and 
maps of the two pathogenic variants pMH32 and pMH41, in accordance 
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with a preferred embodiment of the present invention; and 

Figure 2 shows deletion derivatives used in the study, restriction maps of the mutated 
constructs and recombinants, the phenotype of each mutation, the schematic 
diagram of the sequenced region, and a nucleotide sequence in accordance 
5 ^ with a preferred embodiment of the present invention; and 

Figure 3 shows hydrophobicity plots of SepC and its closest homologue TccC, in 
accordance with a preferred embodiment of the present invention; and 

Figure 4 shows the comparison of protein sequences of the SepA and P. luminescens 
toxins, TcdA, TcaB and TccB Putative RGD motif is boxed, plus the site of 
10 proteolytic cleavage is illustrated, in accordance with a preferred 

embodiment of the present invention; and 

Fi gure 5 shows the comparison of protein sequences of the SepC and P. luminescens 
toxin TccC, in accordance with a preferred embodiment of the present 
^ invention; and 

15 Figure 6 shows the plasmid pADAP, in accordance with a preferred embodiment of 
the present invention. 

Best Modes for Carrying out the Invention 

The invention will be further defined by reference to the specification and the following 
examples and figures herein in the ensuing description by way of example only where: 

20 Figure 1 shows restriction maps of clones used to isolate the pathogenic region and maps of 
the two pathogenic variants pMH32 and pMH41, where: 

(A) Is the pi^DAP HindUl clone pGLA-20 showing locations of the pGLA-20 mutations - 
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10,-13, and 35, which when recombined back into pADAP and bioassayed against grass 
grub, result in either a pathogenic phenotype, shown by full flag, or a healthy but non- 
feeding phenotype indicated by half filled flag. Map of pBG3S showing relative position of 
pGLA-20-3S mutation and the location of the 2.2kb EcdRx used as a probe to screen the 
5 pADAP J5amHI library; and 

(B) Illustrated restriction enzyme maps of the pathogenic clones pMH32 and pMH41, area 
of deletion is indicated by A. 

^ p9.R3^22 vector DNA; 



10 Restriction enzymes are abbreviated as follows: B, BamHU Bg, BglQ^ B, EcoRL H, Hind[E\ 
andXXfeal. 

Figure 2 shows: 

(A) Which are Mini-TniO pACYC184 based deletion derivatives used in the study. 
Omn is the pAC YC 1 84 vector. 

15 A indicates deletion + pathogenic, 
loss of pathogenicity; and 

(B) Dlustrates restriction maps of the mutated constructs pBM32 and the pADK 
recombinants; and 

(C) Where the phenotype of each mutant is indicated by flags. 

20 Blocked flags indicates mutations that did not affect the disease process. 



pLAFR3 vector DNA. 



Open flags indicate mutations that abolish disease symptoms. 
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Half-filled flags denote mutations that abolish visual disease symptoms but are 
unable to feed. 

* indicates pADK mutations obtained by Grkovic et al. (1995). 

Restriction enzymes are abbreviated as follows: B, BamHL, Bg, BglH; E, EcoRI; H, 
5 Hindm\ and X, XbaL 

(D) Is a schematic diagram of the sequenced region, where: 

Ml Denotes sequenced region. 

Arrows indicate ORFs and their direction 

BBSS region homologous to spvB ... location of repeat. 

10 (E) Is a nucleotide sequence of the 5 times 12bp repeat and the palindrome. 

Restriction enzymes are abbreviated as follows: B, BamUL, Bg, BgHl; B, EcoHl', H, HindnL\ 
and X,Xfea/. 

In Figure 3 hydrophobicity plots of SepC and its closest homologue TccC are shown. The 
scale is disproportional to size and has a scanning window of 17 amino-acid residues. 

15 Figure 4 shows the comparison of protein sequences of the SepA and P. luminescens 
toxins, TcdA, TcaB and TccB. Putative RGD motif is boxed. The site of proteolytic 
cleavage is reported by Bowen et al. (1998) (Residue 1933 of TcdA) is indicated by an 
arrow. 

Figure 5 shows the comparison of protein sequences of the SepC and P. luminescens toxin 
20 TccC; and Figure 6 shows the plasmid pADAP. 
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Protocol 

Bacterial isolates and methods of culture 

Table 1 lists bacterial isolates and plasmids used in the present invention. Bacteria were 
grown in LB*broth or on LB agar (Sambrook et al. 1989), at 37** for Escherichia coli and 
5 SO^'C for 5. entomophila. Antibiotic concentrations used (^g/ml) for Serratia were 
kanamycin 100, chloramphenicol 90, tetracycline 30 and for E. coli strains were kanamycin 
50, chloramphenicol 30, tetracycline 15, and ampicillin 100. 

DNA isolation and manipulations 

pADAP DNA was isolated from a 50ml overnight culture of bacteria using QIAGEN® 
10 plasmid maxi kit (Qiagen, Hilden, Germany), as per the manufacturer's instmctions. 
Standard DNA techniques were carried out as described by Sambrook et al. (1989). 
Radioactive probes were made using the Amersham Megaprime DNA labeling system 
(Amersham, Buckinghamshire, UK). Southem and colony hybridisations were performed 
as outlined in Sambrook et al. (1989). The plasmid pADAP is shown in Figure 6. 

15 pADAP SamHI library was constructed using a Sigma 'Gigapack®IIIXL packaging extract, 
as specified by the manufacturer (Stratagene, California, USA). 

Introduction of plasmid DNA into E. coli and S. entomophiUa 

pLAFR3 based derivatives were introduced into S. entomophiUa by tripartite matings on 
solid media as described previously (Finnegan & Sheratt, 1982) using the pRK2013 helper 
20 plasmid (Figorski & Helanski, 1979). pACYC184 and pBR322 based plasmids were 
electroporated into E. coli and 5. entomophiUa strains, using a Biorad Gene Pulser (2|iF, 
2.5KV, and 200 abns) (Dower et al. 1988). 
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Mutagenesis 

Transposon insertions were generated in recombinant plasmids using the mini-rniO 
derivative 103 (kanamycin resistant) as described by Kleckner et al. (1991). Insertions 
were recombined into pADAP by transforming A1M02 (refer to Table 1) with the 
5 described construct. After growth in non-selective media, bacteria were screened for 
resistance to kanamycin and loss of the pLAFR3 tetracycline resistance marker. 

Bioassay agdinst Costelytra zealandica larvae 

Infection of C. zealandica larvae was determined by a standard bioassay where the healthy 
larvae, collected from the field, were individually fed squares of carrot which had been 

10 rolled in colonies of bacteria grown overnight on solid media (resulting in approximately 
10^ cells/carrot square). Twelve, second or third instar larvae were used for each treatment. 
Inoculated larvae were maintained at 15°C, in ice-cube trays. Larvae were left feeding on 
treated carrot for 3-4 days, theh transferred to fresh trays and provided with untreated carrot 
for 1 0-14 days. The occurrence of gut clearance and loss of feeding was recorded every 3-4 

15 days. Strains were considered disease-causing if greater than 70% of larvae showed disease 
symptoms by day 14. Known pathogenic and non pathogenic controls were included in all 
bioassays. Typically cessation of feeding occurs within 2-3 days while clearance of the 
larvae gut may take 4-6 days. 

Recovery of bacteria from larvae 

20 To isolate bacteria from inoculated grubs, larvae were surface sterilised by submerging in 
70% methanol for 30 seconds. The larvae were then shaken in sterile DH2O, removed and 
individually macerated in a 1.5ml microcentrifuge tube. The macerate was serial diluted 
and plated on LB media containing antibiotics selective for the host S, entomophilia strain. 
To assess the stability of the bioassayed plasmid, colonies were patched onto a plate 
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containing antibiotics either selective for the recombinant plasmid or the S. entomophilia 
strain. Identity of plasmids in the recovered strain was checked by restriction enzyme 
profile. 

Nucleotide Sequencing 

5 A 9-kb BamHl --EcoRI fragment derived from the pBM32-8 mutation (Fig 2b) and the 8kb 
HindSL fragment of pBM32 were separately cloned into the appropriate site of the deletion 
factory plasmid pDELTAl. Deletions were generated using the Delection factory™ system 
(GEBCO BRL, MD, USA), as outlined in the manufacturers instructions. To identify the 
precise location of mini-7n70 mutations, the peripheral mini-7n70 BamWi sites were used 
10 in conjunction with the BamYH sites of the pathogenic region to subclone the mini-7niO 
flanking regions into either pACYC184 or pUC19. Sequences were generated using the 
mini-Jn/O specific primer 5' ATGACAAGATGTGTATCCACC3' (Kleckner et al. 1991). 

Plasmids for sequencing were prepared by Wizard® (Promega, Madison, USA) or Quantum 
Prep® (Bio-Rad, California, USA) miniprep kits. Sequences were determined on both 

15 strands, by using combinations of subcloned fragments, custom primers and deletion 
products derived from the deletion factory system (Gibco BRL, Madison, USA). The DNA 
was sequenced using either ^^P dCTP and the Thermosequenase cycle sequencing kit 
(Amersham, Buckinghamshire, UK), or by automated sequencing using an Applied 
Biosystem 373A or 377 autosequencer. Sequence data were assembled using SEQMAN 

20 (DNASTAR Inc., Madison. USA). ORF's were analysed by Gene Jockey. Databases at the 
National Center for Biotechnology Information were searched by using BLASTN and 
BLASTX via the www.ncbi.nIm.gov/BLAST. Searches for DNA palindromes, repeats and 
inverted repeats were undertaken using DNAMAN (Lynnon Biosoft, Quebec, Canada). 
Protein motifs were searched using Blocks (http://www.blocksJhcrc,org/)^ ExPASy 

25 (http:/Avww.expasy.ch/)^ and Gene Quiz (http://columba,ebLac.uk:876S/gqsrv/submit). 
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The sequences determined in this study have been deposited in Gene Bank under Accession 
Number AFl 335 182. 

RESULTS 

Cloning the disease encoding region from pADAP 

5 Previously, Grkovic et al. (1995) have shown taht the pADK-I3 mutation can be 
complemented with the pADAP 1 1 kb Hindm fragment (pGLA-20). However, the pADK- 
10 mutation was unable to be complemented with pGLA-20, In an attempt to isolate the 
region that may complement the pADK-10 mutation the previously described pGLA-20 
derived, pADK-35 null mutation (Grkovic et al. 1995) was used as a selective marker (Fig 

10 1), to select the Bglll fragment encompassing both the pADK-10 and pADK-35 mutations. 
pADK-35 DNA was isolated and digested with the restriction enzyme BglR, The resultant 
digest was ligated into the BaniHl site of bBR322 to form the construct pBG35 (containing 
I2.8kb BglR- mini-7>i70 fragment). pBG35 was placed separately in trans with pADK-10 
and pGLA-20, and the resultant strains bioassayed against grass grub larvae. Results 

15 showed that pBG35 was able to complement the pADK-10 mutant, but was unable to 
induce any symptoms of amber disease when placed in trans with pGLA-20, indicating that 
there must be another region on pADAP needed to induce amber disease. 

Restriction enzyme data of pGLA-20 and pBG35 suggested that the entire pathogenic 
region may reside within one of the large BamHl fragments of pADAP. A cosmid BamHl 

20 library of pADAP was made and screened using the 2.2kb EcoRI fragment derived from 
pBG35 (Fig 1) as the probe. Several probe positive clones were isolated; all shared similar 
restriction enzyme profiles. However, one (designated pMH32) was found to be smaller, 
measuring ooly 23kb in size compared with the 33kb of the other clones (eg. pMH4I; Fig 
lb). The difference between pMH32 and pMH41 was found to be a lOkb deletion at the 

25 left most end of pMH32 encompassing the one HindSL site (Fig 1). E. coli strains 
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containing pMH32 or pMH41 were bioassays against grass grub larvae and found to induce 
the full symptoms of amber disease (that is - gut clearance and antifeeding activity). 
However, about ten days after infection a proportion of grass grubs fed the £*. coli strains 
were found to recover from a diseased to a healthy phenotypc. 

5 The plasmids pMH32 and pMH41 were subsequently introduced into a 5. entomophilia 
strain cured of pADAP (S.6RC) and the strains bioassayed against grass grub larvae. The 
strains gave the same disease progression as wild type and no larvae recovered, suggesting 
that the region cloned in pMH32 contained all the pathogenic determinants of pADAP. 

Effect of copy number and mnx'TnlO insertions in pBM32 on disease-causing ability 

10 To facilitate mutagenesis and assess the effect of copy number on the disease process, the 
23kb BamHI fragment from pMH32 was cloned into the medium copy pJasmid pBR322 to 
give pBM32. A bioassay comparing the ability of pMH32 and pBM32 to induce amber 
disease against grass grub was undertaken. Results showed that there were no visual 
differences in the progression of amber disease between pBM32 and pMH32. The 

15 construct pBM32 was mutated with the mini-rn70 transposon derivative 103, and 
insertions mapped (Fig 2b). Bioassays of E. coli strains containing plasmids of the 
resultant mutants, showed that the disease determinants were confined within a central 
16.9kb region (nucleotides 1955-18937 of SEQ ID NO: 1). 

All strains were non-pathogenic or fully pathogenic, and no partial disease phenotypes such 
20 as antifeeding, or gut clearance were noted. 

To confirm that no sequences at either end of the cloned fragment influenced the disease 
process, several deletion plasmids were made (Fig 2a). The large fragments resulting from 
cleavage of the pBM32 -4, -8, -10, -20, -23, -24 and -35 plasmids with BamHI were cloned 
into the analogues site of pACYC184. The resultant plasmids were transformed into the 
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non-pathogenic 5. entomophilia strain 5.6RKm and assessed for pathogenicity. This 
analysis confirmed that the central 16.9kb region (Fig 2a) was sufficient to induce the 
disease. 

Effect of mm-TnlO insertions in pADAP on disease*causing ability 

5 Grkovic et al. (1995) recombined by marker exchange the pGLA-20 based mutations - 10 
and -13 into pADAP (Fig 2a). When bioassayed, 5. entomophilia strains containing either 
of these mutant plasmids caused a partial condition including cessation of feeding but not 
gut clearance or amber colouration. This was in contrast to the complete abolition of 
disease observed in pADAP-cured 5. entomophilia strains containing mutant pBM32 
10 plasmids with similar insertions. 

To determine the disease phenotype of the pBM32-based insertions in a pADAP 
background, the pBM32 based insertions were transferred into pADAP. pBM32 -1,-2, -4, 
-5, -6, -8, -9, -10, -21, -24, -30, -31 and -35 DNA fragments containing the inserted 
transposon and flanking DNA were cloned as independent fragments into pLAFR3 and the 

15 inserts recombined back into pADAP by marker exchange (Fig 2c). The resultant 
recombinant S. entomophilia strains were checked by Southern analysis to confirm that 
recombination had occurred as expected and no pLAFR3 vector sequences were present 
(data not shown). Mutations that did not affect the disease process in pBM32 also had no 
effect when recombined back into pADAP. However, strains with the pADAP mutants that 

20 totally abolished the disease process when in the pBM32 clone caused non-feeding but not 
gut clearance of the grubs (Fig 2b, c). Hence, none of the pADAP recombinant strains 
completely abolished the disease process. This suggests that, while the 16.9kb fragment 
contains all genes required for pathogenicity, other genes contributing to the antifeeding 
effect are present on some other part of pADAP. 

25 Assessment of plasmid stability during the course of the bioassay showed that greater than 
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90% of the recombinant Serratia strains contained the clone of interest. 
Nucleotide Sequence Analysis of the pathogenic region 

The large BamHI fragment (18937 bp) derived from the pBM32-8 was sequenced on both 
strands using a combination of constructed detections, plasmid subclones and custom made 
5 primers. A total continuous sequence of 18937 bp has been deposited in Gene Bank 
(Accession Number AF135182). Structural analysis of the DNA sequence using 
DNAMAN showed that there was a 12-bp sequence repealed five times between positions 
683 and 743. The repeat is flanked by an upstream 13 base pair palindrome (669-682-bp), 
and a degenerate 34-bp downstream palindrome (765-799-bp)(Fig 2d,e). 

10 Translation of the nucleotide sequence revealed nine significant open reading frames 
(ORP's). These together with their putative ribosomal binding sites and their base 
composition are listed in Table 2. Eight of the ORF's were oriented in the same direction 
and the other two in the opposite direction (Fig 2d). Sequence similarity searches showed 
that the deduced products of seven of these ORF's shared similarity with known proteins 

15 (Table 3). Products of three of the ORF*s showed similarity to different protein 
components of insecticidal toxins oi Photorhabadus luminescents (Bowen et al. 1998). 

These ORF's have been designated sep. (sepA, sepB and sepC) for Serratia entomophilia 
pathogenicity. 

Similarities of deduced amino-acid sequences to proteins in current database 

20 Results of database searches for homologues proteins are listed in Table 4. 

With reference to Fig 2d and Table 4, the following protein similarities were identified: 

The protein product of sepA, had high similarity to the P. luminescents insecticidal toxin 
complex protein TcbA, TcdA, TcaB and TccB. These proteins shared three significant 
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regions of predicted amino-acid similarity, at the amino-terminal region (SepA amino-acid 
residues (121-178), a central region (SepA amino-acid residues 960-1083) and, with 
greatest similarity, at the carboxyl terminus (SepA amino-acid residues 1630-2376) Fig. 4). 
However, there was little amino acid conservation around the putative proteolytic cleavage 
5 site of TcaB, TcbA and TcdA identified by Bowen et al. (1998). SepA also contained a 
region (residues 1057-1345) with weak similarity to the Clostridium bifermentans 
mosquitocidal toxin cbm71 (Barloy et al., 1996). 

SepB and the P. luminescens insecticidal toxin complex protein TcaC shared similarity 
throughout their length, and both SepA and TcaC showed high amino-terminal similarity to 
10 the Salmonella vimlence protein spvB (Gullig et al. 1992) (Fig. 5). The similarity of SepB 
and TcaC to SprB diminishes after SpvB amino acid residue 356. 

SepC showed strong similarity to the amino-terminal of the insecticidal toxin complex 
protein TccC, up to amino-acid residue 663 of SepC. A number of putative bacterial cell 
wall proteins also have high similarity to SepC, including the wall associated protein 
15 precursor A subtilis (WAPA) and members of the E, coli Rhs (recombinant hot spot) 
elements. Strong similarity of SepC was also observed with hypothetical wall-associated 
proteins from Coxiella bumetti and Bacillus subtilis (Table 4). 

The translated sequences of ORFl and ORF2 showed no similarity to sequences in the 
current databases. ORF3 shared significant similarity to the morphogenesis protein of the 
20 Bacillus subtilis bacteriophage B103, a member of bacteriophage muramidase-type lysis 
proteins (Pecenkova et al. 1996). However, relative to size, the gpl9 protein of S. 
typhimurium phage ESI 8 (146 amino-acid residues) or the nucD/regB phage lysozymes of 
5. marcescens (179 amino-acid residues) are more similar. ORF4 showed similarity to E, 
coli bacteriophage N15gp 55 protein, a protein of unknown function (Zimmer et al. 1998). 

25 Located in the same orientation as the sep genes and 134bp downstream of the SepC 
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termination codon is a 204 base pair region assigned ORF5, which has high similarity to a 
S, typhimurium revolvase/invertase protein. However ORP5 is disrupted by two stop 
codons at amino-acid residues 19 and 64, making it unlikely that an active 
resolvase/invertase protein, is encoded by this region. A 256-bp region of encompassed by 
5 ORF5 (17498-17754) showed high similarity (77% identity) to the region (AF020806; 
1629-1885 bp) encoding 5. typhimurium DNA invertase gene (Valdivia et al. 1997) 
suggesting a similar ancestral origin. 

Downstream of ORF5 and oriented in the opposite direction from 18935-18163 was a 870 
base pair rejgion of DNA designated ORF6 whose product showed high amino-acid 

10 similarity over two different reading frames to the insertion element 7591 of £. coli 
(Mendiola et al. 1992). The translated sequence is interrupted at amino-acid residue 149 of 
the 7591 element and later resumed on a second reading frame, before its similarity 
switched back to the original reading frame. Swtiching of ORF*s is a common feature of 
members of the IS3 family where the transposase is encoded by this overlapping ORF's 

15 (Prere et al. 1990). However, the switch back to the initial strand is atypical. ORF6 may 
therefore be a dysfunctional relic of an ancestral 75 element. It is unknown whether ORF6 
contains a ribosomal binding site as its predicted location would lie outside the sequenced 
region. There was no DNA similarity to the 7591 element. 

Analysis fon protein motifs showed that a tripeptide cell-binding motif Asp-Gly-Arg 
20 (RGD), implicated in the binding of various adhesion proteins produced by parasites and 
viruses to eukaryotic cells (Leininger et al. 1991), is present in SepA and the P. 
luminescens TcdA, and TcaB proteins (Fig. 4). The RGD motif is present in cell surface 
adhesions produced by the human pathogen Bordetella pertussis, namely the filamentous 
heamagglutinin (220 kDa) (Relman et al. 1989), and the outer membrane protein pertactin 
25 (69 kDa) (Leininger et al. 1991). These motifs have been implicated in enhancing the 
binding of B. pertussis to eukaryotic cells. Because the RGD motif found in SepA falls in a 
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region of high similarity between SepA and its P, luminescens counterparts, it may play a 
role in meditating the attachment of the protein and/or the bacteria to the insect cell wall. 

The hydropathicity profile of each of the Sep proteins was examined using the Kyte and 
Doolittie algorithm (Kyte and Doolittle, 1982) and compared to the relevant P. luminescens 
5 homologues. None of the Sep proteins contained a positively charged amino terminus 
followed by a hydrophobic region, characteristic of a signal sequence (Gierasch, 1989). 
The profiles of SepA, TcbA and TcdA were very similar (data not shown) and each 
exhibited a steep hydrophilic peak at the carboxyl terminus (residues 2055-2061 of SepA), 
specifically the protein sequence RRRRE (Fig. 4). Although both SepB and TcaC shared 
10 similarity to the Salmonella virulence protein SpvB, the amino-terminil of SepB and TcaC 
were hydrophilic as opposed to the hydrophobic nature of SpvB. The profile of SepC and 
its Photorhabadus counterpart TccC differed in that SepC had a slightly hydrophilic amino- 
terminus, whereas TccC lacked a hydrophilic amino-terminus and had a significantly 
hydrophobic carboxyl terminus from amino-acid residue 717 onwards (Fig. 3). 

15 Analysis to detect repetitive motifs characteristic of the RTX family of toxins (Welch, 
1991) using DOTPLOT showed only P. luminescens TccC contained a plot characteristic of 
a repeat motif present at the carboxy terminal (data not shown). 

Analysis of DNA composition (%GC) and similarity 

Comparisons of the GC content (Table 3) showed that the SepA and SepB genes were more 
20 GC-rich than their P. luminescens counterparts, while SepC and tcaC had similar GC 
content. The high GC content of SepC may be attributed to the close relationship of these 
protein products to the rhs family of wall-associated proteins which have a GC-rich core of 
62% (Wang et aL 1998). Comparisons of the GC content of the Sep genes with that of the 
S. entomophilia genome shows that they are rather similar, suggesting that the sep genes 
25 were not recently acquired by 5. entomophilia. 
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Identification of niini-7>ii0 location by sequence analysis 

Analysis of the insertion points of the previously isolated mini-TnlO insertions (Fig. 2) 
within the putative ORF's (Table 4) revealed that ORF3 and ORF4 were interrupted by the 
-9, -23, -24 (ORF3) and -35 (ORF4) mutations. These insertions had no effect on the 
5 pathogenicity process, suggesting that ORF3 and ORF4 do not play a significant role in 
pathogenicity. However, the pADAP-35 mutation was at the 3' end of ORF4, resulting in 
the truncation of the final 1 1 amino-acid residues of ORF4 (Fig, 4), which may not have 
affected protein function. Further mutagenesis of ORF4 is therefore required to confirm 
that it has no role in pathogenicity. The mutations that caused loss of pathogenicity all 
10 resided within SepA, SepB or SepC, No mutation mapped to ORFl , ORF2 or ORF5. 

Complementation analysis of the sep proteins 

Following sequence data each of the Sep ORF's were excised as closely as possible with 
restriction enzymes, placed into pLAFR3 and placed in trans with the appropriate pADAP 
mutation. Complementation of SepA was undertaken through the use of the 8.5 kb HindSl 
15 clone (pMH45) which encompasses both ORFl and SepA. SepB was excised as a 5.4 kb 
Stul fragment and SepC was excised as a 4.6 kb fragment using one of the peripheral; 
BamVa sites from the pBH32-13 mutation and the Stul site of pBM32 (Fig. 2b). 

Complementation analysis showed that pLAFR3 based SepB and SepC are able to 
complement their mutated pADK- counterparts. Grkovic et al. (1995) had already 
20 previously shown that SepC could complement itself. However, this was achieved through 
using the entire 1 1 kb HiruHSl, pGLA-20 fragment. 

Whether SepA is able to complement itself has yet to be fully established. It was found that 
-98% of the pMH45 construct was lost during the course of the bioassay. This latter result 
was sporadic and occasionally a repeated experiment would show the presence of diseased 
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grubs. Analysis of the macerates of these grubs showed that pMH45 was present indicating 
that pMH45 can possible complement SepA. However before further complementation 
analysis of SepA can be undertaken, measures to ensure the complementation plasmids 
stability are needed. 

5 Discussion 

The large conjugative plasmid, pADAP, of S. entomophilia encodes the genes responsible 
for cessation of feeding and gut clearance, characteristics of amber disease in the New 
Zealand grass grub C zealandica. This plasmid is present in all 5. entomophilia and 5. 
proteamaculans strains capable of causing amber disease (Glare et al, 1993) and had been 

10 implicated in disease processes (Grkovic et al. 1995). The applicant has defined a 16.9 kb 
region of kADAP that is sufficient to confer pathogenicity towards C. zealandica on 
pADAP-cured strains of 5. entomophilia and on strains of £. coli. Hence, the region 
confers all the essential pathogenicity genes of S. entomophilia responsible for amber 
disease. Nucleotide sequence and mutagenesis analysis of the region revealed three genes, 

15 SepA, SepB and SepC, that together are sufficient for pathogenicity. Mutations in any of 
the three genes completely abolished the disease process and partial disease states were not 
detected, suggesting that the three genes may interact to exert an effect. 

The 23-kb region cloned into pBR322 to make pBM32 conferred pathogenicity in pADAP- 
cured 5. entomophilia strains with all symptoms of amber disease being observed. 

20 Insertion mutants in pBM32 that abolished pathogenicity were transferred to pADAP. The 
resultant strains showed a partial disease phenotype, including anti-feeding but not gut 
clearance, suggesting that an additional anti-feeding gene may be present elsewhere on 
pADAP. The occurrence of two different anti-feeding genes on pADAP also supports data 
of Grkovic et al. (1995) who found that suppression of feeding was stronger in the wild- 

25 type pADK*6 strain, compared to the partial disease state (pADK-lO, pADK-13) of 
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inducing antf-feeding but no gut clearance. A putative anti-feeding gene, amb2, has already 
been isolated from the genomic DNA of 5. entomophilia (Nunez- Valdez and Mahanty, 
1996). Recent data indicate that the amb2 locus resides at an as yet to be identified location 
on pADAP that is remote from the region identified herein (Hurst, unpublished data). 

5 Sequence analysis and comparison of the products of the sep genes showed that they share 
significant similarity to the proteins TcbA (TcdA, TcaB, TccB), TcaC and TccC that 
comprise the toxin complexes of P. luminescens. Like the P. luminescens genes that sep 
genes of Serratia share a similar organisational pattern of three genes ordered in succession 
in the same orientation, and opposed by a terminil gene transcribed in the opposite 

10 direction. However, the order of sep genes differ, are slightly smaller in size, and comprise 
constituents of each of the P, luminescens loci tea (tcaB=sepA. tcaC=sepB), luminescens 
toxin gene ted (Ensign et al. 1997) is also similar to SepA. The similarity shared between 
the sep and tc gene products suggests that they are members of a new family of insecticidal 
toxins. The lack of DNA similarity as opposed to protein similarity between sep and P. 

15 luminescens tc genes together with the diffemce in GC content of the sepA and sepB genes 
compared to the tc genes, suggests that these genes were present in the common 
enterobacterial ancestor of P. luminescens and 5. entomophilia and were not acquired by a 
more recent horizontal transfer event. 

The Photorhabadus toxins were isolated as a composite of proteins which are hypothesised 
20 to interact synergistically to form a toxin complex. The toxins are also able to exert an 
anti-feeding effect (Bowen et al. 1998; Bowen and Ensign, 1998). This is consistent with 
the results we obtained with the sep mutants. pADAP-cured S. entomophilia strains 
containing the pathogenicity clone pBM32 exert an anti-feeding effect on the grass grub 
and individual mutations within any of the sep genes have an identical phenotype, 
25 completely abolishing pathogenicity. The Photorhabadus toxins have a wide host range, 
affecting Lepidoptera, Coleoptera and Dictyoptera and undergo post translational 
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proteolytic processing (Bowen et al. 1998). No similarities of sep proteins were found to 
the Photorhabadus toxin component TccA, and only the amino-terminus of TcaA shared 
similarity to SepA. This and the difference in the hydrophobicity profiles of SepC and 
TccC, may account for specificity of the sep proteins towards C. zealandica. However the 
5 sep proteins have yet to be purified and it is unknown whether the sep genes are expressed 
when 5. entomophilia is ingested by other insects. Therefore the possibility that these 
newly-described toxins may exhibit a broader host range cannot be ruled out. 

The Photorhabdus toxin TcbA shares weak similarity to the Clostridium difficile A and B 
toxins (Bowen, 1998), but no such similarities were found to SepA. C difficile A and B 

10 toxins belong to the RTX (repeats in toxin) family of toxins which are noted for the 
presence of several carboxyl terminal repeats (von Eichel-Streiber et al. 1992). A search of 
the sep proteins and their P. luminescens homologues for protein repeats showed that only 
the P. luminescens TcaC protein contained a repeat-type signature. The TcaC carboxy- 
terminal repeat bears little resemblance in size or number of repeats found in RTX toxins 

15 (von Eichel-Streiber et al. 1992). SepA does not show weak similarity to the mosquitocidal 
toxin Cbm71 of C. bifermentans (Barloy et al. 1996). However when this region is 
compared with the relevant Photorhabdus homologues, it is a region with little similarity. 

SepB has strong similarities to both P. luminescens TccC and the Salmonella virulence 
gene product SpvB (Gulig et al. 1992). SpvB is believed to enhance the survival of virulent 

20 Salmonella in macrophages (Libby et al. 1 997). It has been suggested that TcaC may act by 
attacking insect haemocytes (Bowen et al. 1998). However, haemocytes reside within the 
insect haemocoel and 5. entomophilia does not invade the haemococl until late in the 
infection process (Jackson et al. 1993), suggesting that SepB may act in some other way. 
The similarly of SepB and TcaC is high to SpvB but diminishes ten amino-acid residues 

25 upstream of the proline-rich region found in SpvB that is postulated to divide the protein 
into separate domains (Roudier et al. 1992). This may indicate a vital role for the amino- 
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terminus of both SepB and SpvB in interacting with an evolutionarily-conserved eukaryotic 
protein. 

The SepC protein shows high similarity to a family of cell wall-associated bacterial proteins 
such as the B. subtilis wall-associated protein (WAPA) and members of the E. coli rhs 
element family. The function of the Rhs proteins has yet to be established, but they are 
believed to be cell surface ligand-binding proteins (Hill et ai. 1994). The Rhs proteins and 
the B, subtilis was-associated protein contain a characteristic repetitive peptide motif, but 
no such motif was observed in SepC. A feature of rhs elements is the presence of a 
downstream IS element (Wang et al. 1998). A degenerate IS91-type transposase element 
(ORF6) is present downstream of SepC. The IS91 element has been found associated with 
plasmids or chromosomal genes involved in a-haemolysin synthesis, and has been 
postulated to play a pivotal role in the spread of the a-haemolysin genes by means of the 
IS91 -mediated recombinational activity (Zabala et al. 1984). It seems possible an IS 
element adjacent to SepC may have been involved in the acquisition of the sep genes by S, 
entomophilia. 

Blackburn et al. (1998) undertook histological examinations of the lepidopteran Manduca 
sexta after treatment with the P. luminescens Tea toxin complex introduced by feeding or 
haemcoelic injection. They found blebbing of the midgut epithelium into the lumen, 
resulting in lysis and formation of cavities. Similar histological studies have been 
undertaken at various stages throughout the infection cycle of S. entomophilia in C. 
zealandica, and reveal a visible deterioration in the number of fat cells to almost minimal 
levels, and an emptying of the larval gut However no blebbing of the midgut epithelium 
was observed (Jackson et al. 1993). 

The S. entomophilia pathogenicity region endows pathogenicity on members of the 
Enterobacteraceae such as Klebsiella spp,, Enterobacter agglomerans, E. coli, and Serratia 
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species (Glare et al. 1996). From this we can infer that the Sep proteins are the major 
virulence determinants, that the promoters of the sep genes are expressed constitutively or 
under the control of conserved regulatory genes, or a negative regulatory gene present in the 
pathogenicity region, and that export of the toxin proteins is carried out by a conserved 
5 chromosomally encoded system, or is an intrinsic property of the sep proteins. The Sep 
proteins have no obvious amino terminal signal sequences, a facet shared with E-Group 
colicins. The release of cloacin DF13 is mediated through a small lipoprotein designated 
BRP, for bacteriocin-release protein. Low level expression of BRP in conjunction with 
phospholipase A leads to the release of cloacin DPI 3. along with bacterial peripJasmic 

10 proteins. However if expressed in high amounts, BRP causes cell death by cell lysis (vad 
der WaU 1998). The close proximity and similar orientation pattern of ORF3 to the sep 
genes indicate that ORF3 may have an as yet to be determined important functional role. 
Protein similarity searches show that it has high similarity to the bacteriophage lysozyme 
family. In relation to amino-acid size, ORF3 closely resembles the LZBP22 lysozyme of 

15 the Salmonella P2 bacteriophage, a protein essential for the lysis of the bacterial cell wall 
(Rennell and Poteete, 1985). It is possible that ORF3 may facilitate the release of the sep 
proteins by lysing the bacterial cell wall. A low level expression of ORF3 might, as in the 
case of BRP, allow the passage of the sep proteins across the cell wall without causing cell 
death. The reason that the pBM32-9 and -24 mutations were unable to abolish the disease 

20 process could be due to a masking of ORF3 function by natural cell lysis of the bacteria. 

A region of repetitive DNA was identified between nucleotides 683 to 743, centered within 
a 1.2-kb AT rich stretch of DNA that contains no potential ORF's. The repeat motif is 
flanked by an upstream 13-bp palindrome and a degenerate downstream 33-bp palindrome. 
Repeats have been found to be common sites for recombination (Allgood et al. 1988), or to 
25 facilitate the binding of proteins. A 66-bp DNA sequence termed the rsk element for 
reduced serum killing, of the S. typhimurium 95-kb virulence plasmid, comprises of a series 
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of direct 10-bp repeats with a 21 nucleotide periodicity. The rsk element is believed to 
titrate out a rran^-acting factor, enhancing the expression of the Salmonella serum 
resistance gene (Vandenbosch et al. 1989). It is not known whether these repeats and/or 
flanking palindromes have a role in the pathogenicity process. The deletion derivative 
5 pAC24, which encompasses this region, was still pathogenic towards the grass grub. 
However, this deletion could also unknowingly remove the complete regulatory circuit of 
the pathogenicity region, leading to constitutive expression. 

THE ARABINOSE EXPRESSION SYSTEM 

Methodology 

10 Using the polymerase chain reaction (PGR) the initiation codon ATG of the three sep genes 
{sepA, sepB and sepC) were individually placed into the unique Ndel site (restriction 
enzyme site CATGG) of the HIS-tag arabinose expression vector pAV2- 10 (obtained from 
Chuck Shoejnaker -AgResearch). Because large proteins i.e. greater than 50 kda arc 
limited in their ability to bind to HIS tag affinity columns the carboxyl terminus of each of 

15 the Sep proteins did not need to be in frame with the HIS-tag site. Instead wild type DNA 
(non PCRd) containing a downstream chloramphenicol resistance gene was ligated into the 
appropriate restriction enzyme site {sepA Sunl\ sepB HindJH; sepC BstXJ) of the pAV2-10- 
sep derived vectors:- 

-the use of the chloramphenicol resistant marker provided by the vector pACYC184 
20 enhances the stability to each of the expression constructs i.e. -the antibiotic ampicilin to 
which the pAV2-10 is resistant too is cleaved in the media to an inactive form leading to 
possible plasmid free segregants arising. Conversely the antibiotic chloramphenicol is not 
cleaved heightening the level of plasmid stability under conditions of arabinose induction. 
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To validate the legitimacy of the fused genes to the arabinose expression vector, PGR 
generated products and the ligation junctions were verified by DNA sequencing. 

Concurrent to this the sepB and sepC genes were placed as derived from pADAP 
downstream of sepA, Also sepA, sepB and sepC were placed as in pADAP downstream of 
5 orf3. This simulated wildtype conditions (i.e. the arrangement of the sep genes on pADAP) 
and hopefully get the production of the sep genes and the complex driven off the one 
upstream promoter, A method which Western analysis has shown to be successful -with 
moderate levels of sepA, sepB and sepC being detected. 

The arabinose expression system is one of the tightest systems known with almost complete 
10 abolition of gene product under arabiniose free conditions Guzman et aL (1995), this 
abolition can be enhanced by providing glucose to the medium. In contrast providing 
arabinose at the concentration of 0.2% will switch the arabinose promoter on express any 
genes under its control e.g. sepA etc. Typically an overnight culture of the E. coli strain 
was set up the next day an 100 \\\ of the culture was suspended in fresh media 
. 15 supplemented with chloramphenicol (30 jlig/ml) the culture was grown until an OD of 400 
at which time arabinose was added to the culture to a final concentration of 0.2% and the 
culture left shaking at 30 °C for 18 hours. 

To date Western analysis has shown that each of the proteins is expressed and expressed to 
its correct predicted size: 



20 



SepA 262.7 kdal 



SepB 156.6 kdal 



SepC 107 kdal 
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SepC is expressed at high levels with minor levels of proteolytic cleavage. However both 
SepA and SepB though expressed are cleaved in high amounts by endogenous E. coli 
proteases. Alternative strains of E. coli are going to be assessed for loss of proteolytic 
activity against SepA and SepB 

5 It has also been shown that placing all three of the sep genes under the control of a single 
arabinose promoter will result in the production of basil levels of the SepA, SepB, SepC 
toxin complex. 

Each of the following Coleopteran species were mouth injected with 3-5 Jil of an overnight 
suspension of induced bacteria {E. coli strain DHBlOl) containing either SepA, SepB and 
10 SepC or orf3^. SepA, SepB and SepC. 

Each larvae was then given a 3mm^ pi^e of carrot coated with a 50% solution (dHiO) of 
arabinose. Observations were noted each day and the larvae refed with a 3mm^ piece of 
carrot coated with a 50% solution (dH20) of arabinose 

Red headed cock chaffer 

15 Tasmanian grass grub 

Odontara 

Grass grub (positive control) 

Under these Conditions it has been found that the arabinose expressed toxin complex SepA, 
SepB and SepC is active against grass grub but not any of the other species of scarabs 
20 tested (see above). It is therefore thought unlikely that the toxin complex will have activity 
to other insect orders. 
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Summary 

The bacteria Serratia entomophilia and S. proteamaculans cause amber disease in the grass 
grub, Costelytra zealandica (Coleoptera: Scarabaeidae), an important pasture pest in New 
Zealand. Larval disease symptoms include amber colouration, clearance of the gut and 
5 rapid cessation of feeding, before eventual death. The region containing pathogenic 
determinants of the disease has been cloned, and further defined by mutagenesis and 
deletion analysis to a 16.9 kb region. Sequence analysis of the minimal pathogenic 
encoding region showed significant protein homology, but little sequence homology to a 
group of newly described toxins from a member of the Enterobacteriaceae, Photorhabadus 
10 luminescens. This pathogenicity-encoding region from 5. entomophilia plasmid pADAP is 
the subject of the invention. The proteins encoded by the genes (sepA, sepB, sepC) within 
the 1 6.9 kb region can be used for insect control whether as an inundative pesticide, within 
baits or expressed in other organisms such as plants or microbes. 

Aspects of the present invention have been described by way of example only and it should 
15 be appreciated that modifications and additions may be made thereto without departing 
from the scope thereof as defined in the appended claims. 
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Tabic 1 Bacterial strains, plasm Ids and bacteriophage used In the study 



Bacteria 

Escherichia coii 
DH5a 



Description 



DHIOB 



DFl 

MC1061 
MC4100 



XLl-BlueMRA 



F(|)80d /acZpMlS p(/acZY A-argF)U 1 69 recA\ 
endM supE44 

FmcrA p(mrr'hsdRMS-mcrBC)^ZQd iacZpMlS 
placX74 endAI recAl deoKp(ara, leu) 7697 
aroD 1 39 galU galK ttupG rpsL X, 
yS transposase(l>i/7^) 

siqP HsdR mcrB araDl39 p(qraA BC'leu)7679 
p/acX74 galUgalK rpsL thi 
aroDISP p(iacZYA-argF)Ul69 rpsLlSO 
St* reUl flbBSSOl deoCJ ptsF25 
rbsH 

p(mcr^)183 p(mcrCB'hsdSAiR'mrr)173 endAI 
sifE44 thi' J reAl gyrA96 reUI 
Serratia eniomophila 
AIM02 Ap*, pADAP» pathogenic. 

5,6 heat cured pADAP minus derivative of A1M02 

5,6RC Cm* recA' pADAP minus strain 

5.6RK Kn* recA' pADAP minus strain 

Plasmids 

pACYCI84 Cm"Tc* 

pADAP Amber disease associated plasmid 

pBR322 Ap^.Tc* 

pBKf32 23-kb BomHI fragment fh>m pMH32 cloned in 

^ pBR322 

pBM32- 1 -40 pBM32 containing mini-7>i/0 insertions 
pDELTAl Ap*. Sm*, Kn*, sucrose* 

pLAFR3 Tc* pRK290 with Xcoj, /acZa and multi- 

cloning site from pUC8. 
pRK20 1 3 IncP. Kn* Tra RK2 repRK2 repEl 

pGLA20 1 0.6-kb HindlVL pADAP fragment cloned in 

pLAFR3 

pACp4 l9-kb BamVa. fragment from pBM32-4 cloned in 

pACYC184 

pACp8 17-kb Bam\Xi fragment from pBM32-8 cloned in 

pACYCl84 

pACplO 19.5-kb Bamm fragment from pBM32-lO 

cloned in pACYCl84 
pACp20 20-kb BamWi fragment from pBM32-20 cloned 

in pACYC184 

pACp23 2 1 -kb Bamm fragment from pBM32-23 cloned 

in pACYC184 

pACp24 21 .2-kb Bamm fragment from pBM32-24 

cloned in pACYC184 
pADK-10 pADAP::mini-TnyO insertion in 10.6-kb ^mdlll 

fragment, Kn* non-pathogenic 
pADK-13 pADAP::mini-Tn70 insertion in 10.6-kb 

Hindm fragment, Kn* non-pathogenic 
pADK-35 pADAP::mini.Tn/0 insertion in 1 0.6-kb HmdUl 



Reference 



Hanahan(1983) 

Lorow and lessee, 
(1990) 

Gibco BRL 
Casadaban and Cohen, 
(1980) 

Silhavy et al, 
(1984) 

Stratagene 



Grimontera/, (1988) 
Glare e/ a/. (1993) 
Grkovic etal. (1996) 
this study 



Chang and Cohen, 
(1978) 

Glare era/. 1993) 
Bolivar e/tf/. (1977) 
this study 

this study 
Gibco BRL 

Staskawicz et aL (1987) 

Ditta e/ a/. (1980) 
(^rbett (unpublished) 

this study 

this study 

this study 

this study 

this study 

this study 

Grkovic a/. (1995) 
Grkovic era/.(1995) 
Grkovic a/. (1995) 
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pMH32 
pMH4I 
pBM32 
pUC19 



Bacteriophage 
XNKI316 



fragment, Kn'^, pathogenic 

23-kb BamHl frgament of pADAP cloned into 

pLAFR3 

33-kb DamHl fragment of pADAP cloned into 
pLAFR3 

23-kb BamHl fragment of pMH32 cloned into 
pBR322 

Ap*^, iacZxi, multi-cloning site 



mini-Tn/O derivative 103 donor X.b522 cl8S7 
PamSO ninS 



this study 
this study 
this study 

Yannish-Perron, et al. 
(1985) 

Kl6cknere/<a/. (1991) 



Table 2 FosUioa of genes aod features of the predicted gene products encoded by sep genes 



ORF 


Putathre ribosome-binding site' 


Longest potential coding 
reeton 


(p. biftUnseens 
homologue, VoGC) 


Start at 
nucleotide 


Stop at nt 
(ORF size 
bp) 


sepA 


ATGSSACCATCAACGTAATGAA 
TCAGG 


2413 


9547 

ai3i) 


54 

0cbA,43:(cdA, 44) 


sepB 


CGAGGAGACTGAGCATGCAA 


9598 


13885 
(4287) 


58 

(rcarC. 51) 


sepC 


ACAGGAGATCACATGAGC 


14545 


17467 
(2922) 


55 

(rccC. 54) 


ORFl 


CATAG^QACTGTCGCTATGTTA 


1287 


1587 
(300) 


39 


ORF2 


TTGGAGAATAACCGCCATGTT 


1590 


1863 
(273) 


39 


ORF3 


GGGGGAGAAAAATGAAG 


1860 


2294 
(435) 


51 


ORF4 


TGACTGGGAAGGAGGGGGGGAC 
GGTGATGAGT 


13908 


14483 
(576) 


60 


0RF5 


TAACeASACrnTTAGCAAAAT 
GGCACTTT 


I76I-I75S, 1755-1773 


7 


ORF6 


GAGCATGGC-Mini-Tn/O^S ♦ 


18934-18064 


? 



* Putative ribo^me-binding sites are underlined, and potential start codons are in bold&ce; nt, nucleotidiss; ? 
degenerate or incomplete ORF. * ORF transcribed in opposing direction. 



Table 3. Comparisons of GC content between the and P, luminescen genes 



Sep (%0C) 


P. Jumineacea tcMcln (%GC} 


aepA (S4%) 


tchA (43%) todA (44%) 


eepB (58%) 


tcaC (51%) 


aepC (SS%} 


eccC (54%) 
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Table 4. Stmilaritics f pr ducts of putative ORF's to protein sequences in the database 



detected using Blay tP 



ORF 
(a.asize) 


Protein 
homo- 
logue (a.a 
size) 


Degree of simnarity 
Voidentity/Vosimilarity 
(over) a.a residue - a.a 
residue 


Function of the homologous 
protein 


Organism 


Blast score 
Reference* 


SepA 


TcbA 
(2504) 


34/50(1675)41-1628* 
57/72 (751) 1630-2374* 


insecticidal toxin complex 
protein 


Photorhabdus 
luminescens 


0.0 

AF047457 




TcdA 
(2405) 


40/55 (2458)* 


insecticidal toxin complex 
protein 


P. luminescens' 


0.0 

Ensign et a/.. 
(1997) 




TcaB 

dim 


38/54 (764) 1625-2374* 
29/50(281)936-1198* 


insecticidal toxin complex 
protein 


P. luminescens 


AF046867 




TccB 
(1565) 


3wSI (859) 1575-2373* 
31/51(289)930^1204* 


insecticidal toxin complex 
protein 


P, luminescens 


AF047028 




TcaA 
0095) 


36/56 (90) 94-183* 
18i39 (530) 435-928* 


insecticidal toxin complex 
protein 


P. luminescens 


le* 

AF046867 




TccA 
^965) 


27/45(186) 115-280* 


insecticidal toxin conqilex 

protein 


P. luminescens 


5c** 

AF047028 




Cbin71 
(613) 


^AiM^ 

24/41 (199) 1057-1250* 


Mosquitocidal toxin Cbm71 


Clostridium 
bifermentdns 


g2127309 


SepB 
0428) 


TcaC 
0485) 


49/63 (1276) 1- 1 263* 
64/78(152) 1270-1421* 


insecticidal toxin complex 
protein 


P, luminescens 


0.0 

AF046867 




SpvB 
^591) 


40/52 (357) 9-365* 


Salmonella virulence protein 


Salmonella 
typhimurium 


4c*^ 
S22664 


SepC 
(938) 


TccC 
0043) 


53/66 (836) 3-782* 


insecticidal toxin complex 
protein 


P, luminescens 


0.0 

AF047028 




SC2H4.02 
ai83) 


23/34 (639) 68-677* 


Hypothetical wall associated 
protein 


Streptomyces 
coelicolor 


2e"« 

A1j031514.1 




WapA 
(2334) 


22/34(430)255-677* 
20/36(613)48-625* 


Wall associated protein 
Precursor 


B. subHlis 


2e' 

S32920 




Y15898 
(334) 


21/34(542)181-684* 


hypothetical wall associated 
protein 


Coxiella burnetii 


9e^ 

V15898 




Rhscore 
O420) 


21/36(285)35-300* 


ivila core proicul 


c coil 


30"* 

AF044501 


ORF3 
fl44) 


BB103G 
(263) 


45/62(142)1-139* 


morphogenesis protein of 
bacteriophage B103 


Bacillus subtUis 


3e-" 

CAA67646 




LZBP22 
(146) 


46/61 (139) 1-143 


Phage P22, lysozyme (E 
3.2.1.17) 


Salmonella 


le-** 
138699 


ORF4 
091) 


if 


28/42(188)1-184* 


bacteriophage Nl 5 protein 


E, coli 


le^ 

AF064539 


ORFS 
(236) 


SprA 


75/79(68) 1-68 ♦ 


Resolvase/inveitase homologue 


S, typhimurium 


7e-»^ 

AF029069 
AF020806 


ORF6 
(310) 


ISP/ 


39/56(94) 130-I97* -r 
39/58 (94) 224-3 184 -2* 
30/48 (76) 319-395^ V 


IS97 tzansposase 


E» coli 


4e"" 
S23782 



Percent identities and similarities were calculated in relation to the deduced gene products of the sequenced 
ORF. *indicates position of amino-acid similarity in relation to sequence generated in this study. 4 indicates 
position of amino-acid similarity in relation to data base protein sequence. * reading fiame. * stmilazities were 
considered potentially significant if the BlastP score exceeded e '. 
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Table 5 Positions of iiiiiii*Tn7(? insertions 





ORF 


Position downstream of 


insertion U 




initiation codon (bp) 


9/23 


ORF3 


120 


24 


ORF3 


345 


4 


sepA 


747 


27 


sepA 


1037 


40 


sepA 


1097 


6 


sepA 


1727 


38 


sepA 


2887 


2 


sepA 


3197 


5 


sepA 


3737 


3 


sepA 


3697 


19 


sepA 


3697 


30 


sepA 


4467 


37 


sepA 


4467 


31 


sepA 


4627 


12 


sepB 


182 


22 


sepB 


172 


11 


sepB 


362 


10 


sepB 


2162 


35 


ORF4 


557 


13 


sepC 


2525 


8 




18937 


ORF4/-35 junction GGG CXKl TGA TGA ATC 
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The Claims Defining the Invention are: 

1. A purified and isolated nucleic acid molecule comprising a nucleotide sequence of SEQ ID 
NO: 1 that encodes at least one oh 

(i) an insecticidal protein complex* or 

(ii) a funcdonal fragment of said complex, or 

(iii) a neutral mutation of said complex, or 

(iv) a homolog of said complex » 

each of which have at least 75% nucleic acid homology to SEQ ID NO: 1 and are capable 
of hybridising with said nucleic acid molecule under stringent hybridisation conditions. 

2. A purified and isolated nucleic acid molecule as claimed in Claim 1 comprising the 
nucleotide sequence 1995-1 8937 of SEQ ID NO: 1 . 

3. A purified and isolated nucleic acjd molecule as claimed in Claim 1 comprising one or 
more of the nucleotide sequences 241 1-9547, 9589-13883 or 14546-17467 of SEQ ID NO: 
1. 

4. A purified and isolated nucleic acid molecule as claimed in Claim 3 comprising all of 
nucleotide sequences 241 1-9547, 9598-13884 and 14546-17467 of SEQ ID NO: .1. 

5. A purified and isolated nucleic acid molecule as claimed in Claim 1 comprising a sequence 
of SEQ ID NO: 1, operably linked to at least one further nucleotide sequence which 
encode an insecticidal protein. 

6' A purified and. isolated nucleic acid molecule as claimed in Claim 2 comprising 
nucleotides 1955-18937 of SEQ ID NO: 1, operably linked to at least one further 
nucleotide sequence which encode an insecticidal protein. 
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7. A purified and isolated nucleic acid molecule as claimed in Claim 3 comprising a sequence 
of SEQ ID NO: l,or one or more of nucleotides 2411-9547, 9598-13884 or 14546-17467 
of SEQ ID NO: 1, operably linked to at least one further nucleotide se(^ence which 
encode an inseciicidal protein. 

8. A purified and isolated nucleic acid molecule as claimed in any one of claims 4 through 6 
■wherein the said nucleotide sequence includes the nucleotide sequence which codes for at 
least one of the Bacillus delta endo toxins, vegatativc insecticidal proteins (vips), 
cholesterol oxidases, Clostridium bifermentens mosquitocidal toxins and/or 
Photorhabadids iuminescens toxins. 

9. A purified and isolated nucleic acid molecule as claimed in claim 1 wherein nucleic acid 
molecule may comprise DNA, cDN A or RNA. 

t 

10. A purified and isolated nucleic acid[ molecule as claimed in claim 1 wherein the nucleic 
acid molecules said £ragn>ent. neutral roucarion or homolog thereof capable of hybridising 
to said nucleic acid molecule, hybridise to the nucleodde sequence of SEQ ID NO: U or 
nucleotides 1955-18937, 2411-9547, 9598-13884 or 1454^17467 of SEQ ID NO: 1 if 
there is at least 75% or greater identity between the sequences. 

11. A purified and isolated nucleic acid molecule as claimed in claim 1 wherein chp nucleic 
acid molecule may be isolated f^om Serratia entomophUa or Serratia proteamacuUms 
strains of bacteria. 

12. A recombinant expression vector(s) containing the nucleic acid molecule as claimed in 
Claim 1 and host transformed with the vector expressing a polypeptide. 

13. A recombinant expression vcctor(s) as claimed in claim 1 1 wherein the vector is selectable 
from any suitable natural or artificial plasmid/vector. 

14. A recombinant expression vector(8) as claimed in claim 13 wherein said suitable natural or 
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artificial plasmid/vcctor, including, pUC 19 (Yanoish-Pciron et al. 1995)» pProEX HT 
(GibcoBRL. Gaithersburg, MD, USA), pBR322 (Bolivar et al. 1977), pACYC184 (Chang 
et al. 1978). pLAFR3 (Staskowicz ct al. 1987), 

15. A polypeptide resulting from the transformation or transfcction of a host cell with a 
.recombinant expression vector as claimed in any one of Claims 12 through 14. 

1 6. A method of producing a polypeptide of claim 15 comprising the steps of: 

(a) culturing a host cell which has been transformed or traxisfected with said vector as 
defined above to express the encoded polypeptide or peptide; and 

(b) recovering the expressed polypeptide or peptide. 

17. *The use of a ligand that binds to a polypeptide of claim 15 to isolate and/or identify the 
polypeptide of claim 15, 

1 8. An antibody or antibody binding fragment that binds to a polypeptide of claim 15. 

19. Probes and primers comprising a firagment of the nucleic acid molecule as claimed in 
Claim 1 wherein said fragment is hybridisable under stringent conditions to a native 
insecticidal gene sequence. * 

20. Probes and primers comprising a fragment of the nucleic acid molecule as claimed in 
claim 19 wherein said probes and primers enable the structure and function of the gene to 
be determined and horaologs of the gene to be obtained from bacteria other than Serratia 
sp. 

21. A polypeptide as claimed in Claim 15 wherein the polypeptide has insecticidal activity 
encoded by the nucleic acid molecule of claim 1, or a functional fragment, neutral 
mutation or homolog thereof 

22. A polypeptide having insecticidal activity as claimed in claim 21 wherein the polypeptide 
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comprises the amino acid sequence of SEQ ID NO: I or a functional fragment, neutral 
mutation or homolog thereof. 

23. A polypeptide having insecticidal activity as claimed in claim 21 wherein the polypeptide 
comprises amino acids 32-51 18 of SEQ ID NO: 1. 

24. A polypeptide having insecticidal activity as claimed in claim 21 wherein the polypeptide 
comprises at least one amino acid sequence of SEQ ID NO: 2; SEQ ID NO: 3; SEQ ID 
NO: 4; SEQ ID NO: 5 or SEQ ID NO: 6. 

25. A polypeptide having insecticidal activity as claimed in claim 24 wherein the polypeptide 
preferably comprises amino acid sequence SEQ ID NO: 4; SEQ ID NO: 5 and SEQ ID 
NO: 6. 

26. A polypeptide having insecticidal activity as claimed in claim 24 wherein the polypeptide 
preferably comprises all of SEQ ID NOs: 2-6. 

27. A polypeptide having insecticidal activity as claimed in claim 21 wherein the polypeptide 
is obtained by expression of a DNA sequence coding therefore in a host cell or organism. 

28. A polypeptide having insecticidal activity as claimed in claim 27 wherein the polypeptide 
comprises the amino acid sequence of SEQ ID NO: 1 linked to at least one further amino 
acid sequence encoding an insecticidal protein. 

29. A polypeptide having insecticidal activity as claimed in claim 28 wherein the at least one 
further amino acid sequence includes the amino acid sequence which codes for Bacillus 
delta endo toxins, vegatative insecticidal proteins (vips), cholesterol oxidases, Clostridium 
bifermentens mosquitocidal toxins and/or Photorhabadus luminescents toxins. 

30. A polypeptide having insecticidal activity as claimed in claim 28 wherein the polypeptides 
comprise at least 50%, preferably 60%, more preferably 70% and most preferably 90-95% 
or greater identity to SEQ ID NO: 1. 
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31. A polypeptide having insecticidal activity as claimed in claim 21 wherein the polypeptide 
is produced by expression of a vector comprising the nucleic acid of SEQ ID No: 1 or a 
functional fragment, neutral mutation or homolog thereof, in a suitable host cell. 

32. An insecticidal composition comprising at least the polypeptide as claimed in claim 21 and 
an agriculturally acceptable carrier. 

33. An insecticidal composition as claimed in claim 32 wherein more than one polypeptide is 
included in the composition. 

34. An insecticidal composition as claimed in claim 32 or 33 wherein the composition 
comprises additional pesticides, including compounds known to possess herbicidal, 
fungicidal, insecticidal or nematicidal activity. 

35. An insecticidal composition as claimed in claim 34 wherein the composition comprises 
other known insecticidally active agents, including Bacillus delta endo toxins, vegatative 
insecticidal proteins (vips), cholesterol oxidases, Clostridium bifermentens mosquitocidal 
toxins and/or Photorhabadus Iwninescents toxins. 

36. A method of combating pests, said method comprising applying to a locus, host and/or the 
pest, an effective amount of the polypeptide as claimed in Claim 21 that has functional 
insecticidal activity against said pest. 

37. A method of inducing amber disease or like condition in insects comprising delivery to an 
insect an effective amount of the polypeptide as claimed in Claim 21 that has functional 
insecticidal activity against said insect. 

38. A method of inducing amber disease or like condition in insects as claimed in claim 37 
comprising delivery to an insect an effective amount of the polypeptide wherein the insect 
is selected from the order comprising Coleoptera. 

39. A method of inducing amber disease or like condition in insects as claimed in Claim 38 
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comprising delivery to an insect an effective amount of the polypeptide wherein the insect 
includes Costelytra zealandica (Coleoptera: Scarabaeidae). 

40. A method of delivering the insecticidal polypeptide to induce amber disease or like 
condition in insects including delivery of the insecticidal polypeptide as claimed in Claim 
39 to the insect by any one of presenting the insecticidal polypeptide orally as a solid bait 
matrix, as a sprayable insecticide sprayed onto a substrate upon which the insect feeds, 
applied directly to the soil subsurface or as a drench or is expressed in an transgenic plant, 
bacterium, virus or fungus upon which the insect feeds. 

41. A transgenic plant, bacterium virus or fungus, incorporating in its genome, a nucleic acid 
molecule as claimed in Claim 1 for providing the plant, bacterium virus or fungus with an 
ability to express an effective amount of an insecticidal polypeptide. 
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ccgttagcta taaatatgat atttaaatct gtatttttat ataaaaccag tttatgatgc 1080 

tggattggitc attaaagtcg ttatatgtga tcgttatctg tcattgattg gtigtttaatc 1140 

ttttattctt ccagtgaggt ttcaggggga atgtattggg taatcatact catgtcattt 1200 

gttgctttga tgttaaatta acgtgttcat tcattatgtt ctactgttgt ttctattgtc 1260 

cggaacgacc atagagactg tcgctatgtt aataggaata tttgactggt tatatgcgcc 1320 

aagggttatc gctgcactct ctggggcgat ggtattcatc attacgcaag ataacttcat 1380 

tggtgtcaga cgggtgttat tgttttttgt gtctttttta ctcggtttga cattttcaga 1440 

gacaacagct tccgttatca acttctatat cccgaatgat atacatatag gaaatgacct IS 00 

tggtgccttt gttaccagcg ccgtgacggt gaagcttttt gttatcatta tgagcaagat 1560 

agagagaaaa tatcttggag aataaccgcc atgttccaaa tcatacttct taatgttaat 1620 

gccgtgattt gcttggctat tgccgtcaga ttattcctgt ggcgtatcaa tcataaaatg 1580 

aaaaacattg tcgtctcttt taetgctttt ctcattattei cggcgtgcgg cgctgtctcc 1740 

atcaggacga tgacggggga gtattactat gcggattggt ccgagacgat cattaacctt 1800 

tcgcttttcc tgtctgttta tatacgcaat ggcgaaatcc ttcggtgggg ggagaaaaa 1859 
\ 

atg aag at a agt tec cga ggt ate gca tta ate aaa gag ttc gaa ggt 1907 
Met hys He Ser Ser Arg Gly He Ala Zieu He Lys Glu Phe Glu Gly 
15 10 15 

ctg cgc tta cac get tat cgc tgc gee get gac gtc tgg act gtc ggt 1955 
Leu Arg Leu His Ala Tyr Arg Cys Ala Ala Asp Val Trp Thr Val Gly 
20 25 30 

tat ggc cac acg gca ggg gtt aca aag ggt gac ate ate acg gtc gat 2003 
Tyr Gly His Thr Ala Gly Val Thr Lye Gly Asp He He Thr Val Asp 
35 40 45 

gaa gee cag acg atg ctg aca aac gat att ace gta ttt gaa egg gcg 2051 
Glu Ala Gin Thr Met Leu Thr Asn Asp He Thr Val Phe Glu Arg Ala 
50 55 60 

gtc agt cag gee gtc gcg gtt ect ctg aat cag teg caa ta'c gat gee 2099 
Val Ser Gin Ala Val Ala Val Pro Leu Asn Gin Ser Gin Tyr Asp Ala 
65 70 75 80 

ctg gtt tct ttg gtt ttt aat att ggc cag ggg' aat ttt aaa cgc tct 2147 
Leu Val Ser Leu Val Phe Asn He Gly Gin Gly Asn Phe Lys Arg Ser 
85 90 95 

acc ttg ttg aaa aaa etc aac aaa cag gac tat gtc ggc gcc ggg aac 2195 
Thr Leu Leu Lys Lys Leu Asn Lys Gin Asp Tyr val Gly Ala Gly Asn 
lOO 105 110 

gag ttt tta cgc tgg acc egg gcc aat ggg aag gtc ctt ccc gga ctg 2243 
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Glu Phe Leu Arg Trp Thr Arg Ala Asn Gly Lys Val Leu Pro Gly Leu 
115 120 125 

att cgc cga cgc gaa get gaa egg gtg ttg ttt gag aaa ctg ggt gca 2291 
lie Arg Arg Arg Glu Ala Glu Arg Val Leu Phe Glu Lys Leu Gly Ala 
130 135 140 

taa ccctttgcga cgtacccaca agatgaagat aacaccgcgt actgagcggt 2344 

145 

ggcgcaacaa tgaataaatg actgtgtacg gcctgtcctt cacaacggat gggaccatca 2404 

acgtaa tga atg agg caa gac att atg tat aat att gat gat att ctg 2452 
Met Arg Gin Asp He Met Tyr Asn He Asp Asp tie t«eu 
150 155 

gag aaa gtg aat get cca cga gca cgc ctg tea gaa gaa aac gat aca 2500 
Glu Lys Val Asn Ala Pro Arg Ala Arg Leu Ser Glu Glu Asn Asp Thr 
160 165 170 175 

gcg gtg acg ctg aeg gat tta ttc teg cgt teg ttt cec gag gtc aaa 2548 
Ala Val Thr Leu Thr Asp Leu Phe Ser Arg Ser Phe Pro Glu Val Lys 
180 185 190 

aaa ate act ggc gac age ctg tea tgg gga gag gtc tgc tat ctg tac 2596 
Lys He Thr Gly Asp Ser Leu Ser Trp Gly Glu Val Cys Tyr Leu Tyr 
195 200 205 

agt cag gcg cag cac gaa cag aaa gaa aac egg etc ace gaa tec egt 2644 
Ser Gin Ala Gin His Glu Gin Lys Glu Asn Arg Leu Thr Glu Ser Arg 
210 215 220 

att ctg gee egg gcg aat ecc eta ctg gtg aat gcc gtt cgc ctg gga 2692 
He Leu Ala Arg Ala Asn Pro Leu Leu Val Asn Ala Val Arg Leu Gly 
225^ 230 235 

ata egg cag gca gcc ggc agt cgc age tat gat gac tgg ttt ggc tec 2740 
He Arg Gin Ala Ala Gly Ser Arg Ser Tyr Asp Asp Trp Phe Gly Ser 
240 245 250 255 

cgc gca gac cgt ttc gcc cgc ecc ggc teg gtg gee tec atg ttc tea 2788 
Arg Ala Asp Arg phe Ala Arg Pro Gly Ser Val Ala Ser Met Phe Ser 
260 265 270 

ccg gcg gcg tat ctg ace gag ctg tac cgt gag gcg aag gac ctg cat 2836 
Pro Ala Ala Tyr Leu Thr Glu Leu Tyr Arg Glu Ala Lys Asp Leu His 
275 280 285 

ccg gac ace teg ctg ttc egg ctg gac ate egg cgt ecc gac ctg gcg 2884 
Pro Asp Thr Ser Leu Phe Arg Leu Asp He Arg Arg Pro Asp Leu Ala 
290 295 300 

gcg ctg gcc ctt age cag aat aat atg gac gac gag etc tec acc ctg 2932 
Ala Leu Ala Leu Ser Gin Asn Asn Met Asp Asp Glu Leu Ser Thr Leu 
305 310 315 

age ctg tec aat gag eta ctg tat cgc ggt ate ggg gca gcg gaa ggg 2980 
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Ser Leu Ser Asn Glu Leu Leu Tyr Arg Gly lie Gly Ala Ala Glu Gly 
320 325 330 335 

ctt gac gac gac age gtc agg gag ctg etc gcc ggg tat cgc ctg ace 3028 
Leu Asp Asp Asp Ser Val Arg Glu Leu Leu Ala Gly Tyr Arg Leu Thr 
340 345 350 

ggc ctg acc ccc tat cac tgg gcg tac gag gcg gcc cgc caa gcc att 3076 
Gly Leu Thr Pro Tyr His Trp Ala Tyr Glu Ala Ala Arg Gin Ala lie 
355 360 365 

ctg gtg cag gac ccg acg ctg atg ggg ttc age cgt aat ccg gat gtg 3124 
Leu Val Gin Asp Pro Thr Leu Met Gly Phe Ser Arg Asn Pro Asp Val 
370 375 3B0 

gcg cag ctt atg gac cct gcc tec atg ctg gcc att gaa gcc gat att 3172 
Ala Gin Leu Met Asp Pro Ala Ser Met Leu Ala He Glu Ala Asp He 
3^5 390 395 

tea ccg gag ctg tat cag ata ctg gcc gaa gaa att acg aca gac agt 3220 
Ser Pro Glu Leu Tyr Gin He Leu Ala . Glu Glu He Thr Thr Asp Ser 
400 405 410 415 

tac gaa gca etc tgg agt aag aat ttt ggt gat atg cct ccc tec tea 3266 
Tyr Glu Ala Leu Trp Ser Lys Asn Phe Gly Asp Met Pro Pro Ser Ser 
420 425 430 

ctg tta tet tat gat gca ctt gca aca ttt tat gat ctt gat tac gat 3316 
Leu Leu Ser Tyr Asp Ala Leu Ala Thr Phe Tyr Asp I«eu Asp Tyr Asp 
435 440 445 

gag eta act teg tta ttg tea tta agg ctg gac ttt tea aat eca aac 3364 
Glu Leu Thr Ser Leu Leu Ser Leu Arg Leu Asp Phe Ser Asn Pro Asn 
450 455 460 

aat gaa tac tac att aat agt caa tta agt gtc gta act ctg aat gaa 3412 
Asn Glu Tyr Tyr He Asn Ser Gin Leu Ser Val Val Thr Leu Asn Glu 
465 470 475 

age act ggt tta ata act ata eat cat tat tta aga acg eta ggc gga 3460 
Ser Thr Gly Leu He Thr He His His Tyr Leu Arg Thr Leu Gly Gly 
480 485 490 495 

gac tea cag cag att aac cct gag ctt ata cct tat ggg gat gga aca 3508 
Asp Ser Gin Gin He Asn Pro Glu Leu He Pro Tyr Gly Asp Gly Thr 
500 505 510 

tat ctt tat aat tte age gtg gtg tea acg ata tea gag gat agt ttc 3556 
Tyr Leu Tyr Asn Phe Ser Val Val Ser Thr He Ser Glu Asp Ser Phe 
515 520 525 

aaa eta ggg teg tta ggt tct aac agt age aat ctt tac tet ggg gat 3604 
Lys Leu Gly Ser Leu Gly Ser Asn Ser Ser Asn Leu Tyr Ser Gly Asp 
530 535 540 

tat cag ctt caa aaa ggg gtt cgc tat age att cct gtt gaa ata gat 3652 
Tyr Gin Leu Gin Lys Gly Val Arg Tyr Ser He Pro Val Glu He Asp 
545 550 555 
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gaa gga^aag tta aat gat ggg ate aca ata gga ttg agt agg aaa ggg 
Glu GXy Lys Leu Asn Asp Gly lie Thr lie Gly Leu Ser Arg Lys Gly 
560 565 570 575 



3700 



ggg gga tat tac tea aca gta aac ttc act ctg att gaa tat gat cct 3748 
Gly Gly Tyr Tyr Ser Thr Val Asn Phe Thr Leu lie Glu Tyr Asp Pro 
560 585 590 

gcg ata ttc att ctt aaa tta aat aaa gtt ate cgc eta tac aag gee 3796 
Ala lie Phe He Leu Lys Leu Asn Lys Val He Arg Leu Tyr Lys Ala 
595 600 605 

acg ggc atg acc acg gcg gaa ata tat caa ate acc aat att ctt aat 3844 
Thr Gly Met Thr Thr Ala Glu He Tyr Gin He Thr Asn He Leu Asn 
610 615 620 

aac ggt etc acc att gac cat gcg gtc ctg agt aaa ate ttc ctg gtc 3892 
Asn Gly Leu Thr He Asp His Ala Val Leu Ser Lys He Phe Leu Val 
625 630 635 

cgt tac ctg atg cgt cac tat cag ctt gat gtg gcc egg tea ctg ata 3940 
Arg Tyr Leu Met Arg His Tyr Gin Leu Asp Val Ala Arg Ser Leu He 
640 645 650 655 

ttg tgc aac gga acc ate agt gac cag gcg ttc age ggc gaa acc ggc 3988 
Leu Cye Asn Gly Thr He Ser Asp Gin Ala Phe Ser Gly Glu Thr Gly 
^ 660 665 670 

ctg ttc acc acg ctg ttc aac acc cca ccg ctg aac ggc cag ctg ttt 4036 
Leu Phe Thr Thr Leu Phe Asn Thr Pro Pro Leu Asn Gly Gin Leu Phe 
675 680 685 

tct gca gat gat acc ccc etc gac tta cgc tct gaa gca ccg gag gat 4084 
Ser Ala Asp Asp Thr Pro Leu Asp Leu Arg Ser Glu Ala Pro Glu Asp 
690 695 700 

get ttc cgt etc age gta ctg aaa cgc gca ttt aac ate age gcc teg 4132 
Ala Phe Arg Leu Ser Val Leu Lys Arg Ala Phe Asn He Ser Ala Ser 
705 710 715 

ggg ctt tec acg etc tgg cag ttg gcc age ggt gac age age get ggg 4160 
Gly Leu Ser Thr Leu Trp Gin Leu Ala Ser Gly Asp Ser Ser Ala Gly 
720 725 730 735 

ttt age tgc tct get gac aat ate gcc gca etc tac ega gtg aaa etc 4226 
Phe Ser Cys. Ser Ala Asp Asn 'He Ala Ala Leu Tyr Arg Val Lys Leu 
740 745 750 

ctg get gac ate cac gac eta tec get ggt gag ctg tea atg ttg ctg 4276 
Leu Ala Asp He His Asp Leu Ser Ala Gly Glu Leu Ser Met Leu Leu 
755 760 765 

tec gtc tec cct ttc age ggg gtg gcc gee ggc teg ctg tec gat aat 4324 
Ser Val Ser Pro Phe Ser Gly Val Ala Ala Gly Ser Leu Ser Asp Asn 
770 775 780 

^ag ctg acg cag ttt ctg tac cag acc acc acc tgg etc acg gag cag 4372 



wo 01/16305 PCT/NZOO/001 74 



Glu Leu Thr Gin Phe Leu Tyr Gin Thr Thr Thr Trp Leu'Thr Glu Gin 
785 790 795 

ggc tgg acg gtc age gat gtg ttc ctg atg ctg acg acg cag tac ggt 4420 
Gly Trp Thr Val Ser Asp Val Phe Leu Met Leu Thr Thr Gin Tyr Gly 
800 805 810 BIS 

acc ctg ctg acc ccc gac att gag aac ctg etc get tec ctg cgc aac 4468 
Thr Leu^Leu Thr Pro Asp He Glu Asn Leu Leu Ala Ser I/eu Arg Asn 
820 825 830 

gga ctg teg ggc cgt gag ctg ttc ccg gaa acg etc ccc ggc gat ggc 4516 
Gly Leu Ser Gly Arg Glu Leu Phe Pro Glu Thr Leu Pro Gly Aap Gly 
835 840 845 

get ccc ttt att gee gee gee atg cag ctg gac gcc acg gat acg gcg 4564 
Ala Pro Phe Zle Ala Ala Ala Net Gin Leu Asp Ala Thr Asp Thr Ala 
850 855 860 

aag gcg atg ctg act tgg gcg gac cag ttg aag cca gag ggg ctg acg 4612 
Lys Ala Met Leu Thr Trp Ala Asp Oln Leu Lys Pro Glu Gly Leu Thr 
865 670 875 

ctg acg gaa ttt att ctt ttg gtg atg aat gcc gcc cca aat gac gag 4660 
Leu thr Glu Phe He Leu lieu Val Met Asn Ala Ala Pro Asn Asp Glu 
880 885 890 895 

cag gcg ggc cag atg gca ggg ttc tge caa gcc ctg tgg caa ctg gca 4708 
Gin Ala Gly Gin Met Ala Gly Phe Cys Gin Ala Leu Trp Oln Leu Ala 
900 905 910 

ctg ate ate cgc age acc ggc etc age acg cgc gag ctg acg ctg ctg 4756 
Leu He He Arg Ser Thr Gly Leu Ser Thr Arg Glu Leu Thr Ifcu Leu 
^ 915 920 925 

gtc age cag ccg gga cgc ttc cgc aca gga tgg cac cat ctg ccc cat 4804 
Val Ser Gin Pro Gly Arg Phe Arg Thr Gly Trp His His Leu Pro His 
930 935 940 

gac ctg ccg gcg ctt cgc gac att acg cgt ttt cat gcc gtc gtt aac 4852 
Asp Leu Pro Ala I«eu Arg Asp Zle Thr Arg Phe His Ala Val Val Asn 
945 950 955 

cgc age ggc age cat gcc ggg gag gtc ctg acc gca ctt gag acc gga 4900 
Arg Ser Qly Ser His Ala Gly Glu Val Leu Thr Ala Leu Glu Thr Gly 
960 965 970 975 

gaa ctg teg tea gcc ctg ctg gcc egg gcc ctg tea cag aat gag cag 4948 
Glu Leu Ser Ser Ala Leu Leu Ala Arg Ala Leu Ser Gin Asn Glu Gin 
980 985 ' 990 

gat gtg acc ggc gcc ttg gcg cag gtg agg ggg gcc ggt gaa cag gac 4996 
Asp Val Thr Gly Ala Leu Ala Gin Val Arg Gly Ala Gly Glu Gin Asp 
995 1000 1005 

aac age gtg ttc ace tec tgg gaa gag gtg gac cag get gag cag tgg 5044 
Asn Ser Val Phe Thr Ser Trp Glu Glu Val Asp Gin Ala Glu Gin Trp 
1010 1015 1020 
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ctg gac atg agt gag acc ctg tec att acg cca tec ggt ctg get age 
Leu Asp Met Ser Glu Thr Leu Ser lie Thr Pro. Ser 61y Leu Ala Ser 

1025 1030 1035 



5092 



ctg att gee ctg aag tac ate aat gtg tec gat gac agt gca ceg ttg 
Leu He Ala Leu Lys Tyr He Asn Val Ser Asp Asp Ser Ala Pro Leu 
1040 1045 1050 1055 



5140 



tac age cag tgg eag gtg gta tec ggt ctg ctg cag gee ggg ctg aaa 
Tyr ser Gin Trp Gin Val val Ser Gly Leu Zjeu Gin Ala Gly Leu Lys 
1060 1065 1070 



5168 



age age eag age teg gcg ctg cac gat tat ctg gag gag ggg acc age 
Ser Ser Gin Ser Ser Ala Leu His Asp Tyr Leu Glu Glu Gly Thr Ser 
1075 1080 1085 



5236 



age gee ctt tgt gcg tat tat ctg cgt aat ctg gca ccg aac atg gta 
Ser Ala Leu Cys Ala Tyr Tyr Leu Arg Asn Leu Ala Pro Asn Net Val 
1090 1095 1100 



5284 



tec ggg cgc gat gac etc ttc ggg tat ctg ctg ctg gat aat cag gtg 
Ser Gly Arg Asp Asp Leu Phe Gly Tyr Leu lieu Leu Asp Asn Gin Val 
1105 1110 1115 



5332 



tea gee aag gta aaa ace acc cgc att gcg gag gee ate gee ggc ata 
Ser Ala Lys Val Lys Thr Thr Arg He Ala Glu Ala Zle Ala Gly He 
1120 1125 1130 1135 



5380 



egg ctg tat ate aac egg gcc ctt aac gga ata gaa etc age gee atg 
Arg Leu Tyr He Asn Arg Ala Leu Asn Gly He Glu Leu Ser Ala Met 
1140 1145 1150 



5428 



gca gag gtg agg ggg cgt cag ttt ttc act gac tgg gat acg ttc aac 
Ala Glu Val Arg Gly Arg Gin Phe Phe Thr Asp Trp Asp Thr Phe Asn 
1155 1160 1165 



5476 



aaa cgt tac age acc tgg gcg ggc gtc tea gag ctg gtt tac ^at ccg 
Lys Arg Tyr Ser Thr Trp Ala Gly Val Ser Glu Leu Val Tyr Tyr Pro 
1170 1175 1180 



5524 



gaa aa^ tac etc gac ccg acg gtc cgt ate ggg cag acc ggc atg atg 
Glu Asn Tyr Leu Asp Pro Thr Val Arg He Gly Gin Thr Gly Met Met 
1185 1190 1195 



5572 



gac acc ctg ctg cag tct gtc age cag age agt ate aac cgc gat acc 
Asp Thr Leu Leu Gin Ser Val Ser Gin Ser Ser He Asn Arg Asp Thr 
1200 1205 1210 1215 



5620 



gtg gag gat gcc ttt aaa acc tat ctg acc acg ttt gag cag att gcc 
Val Glu Asp Ala Phe Lys Thr Tyr Leu Thr Thr Phe Glu Gin He Ala 
1220 1225 1230 



5668 



aat ctg aac act gtc age gga tat cac gat aac gcc age atg acg cag 
Asn Leu Asn Thr Val Ser Gly Tyr His Asp Asn Ala Ser Met Thr Gin 
1235 1240 1245 



S716 



ggg act aca tgg tat gtg ggt cgc age ate aca gat cag act aac tgg 5764 
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Gly Thr Thr Trp Tyr Val Gly Arg Ser He Thr Asp Gin Thr Asn Trp 
1250 X255 1260 



tac tgg cgc age gcc aac cac age aaa ate caa gac tea atg atg ccc 
Tyr Trp Arg Ser Ala Asn His Ser Lys He Gin Asp Ser Met Met Pro 
1265 1270 127S 



5812 



gcg aat gcc tgg acc gga tgg aca aaa att aac tgc gga atg aat ccg 
Ala Asn Ala Trp Thr Gly Trp Thr Lys He Asn Cys Gly Met Asn Pro 
12B0 1285 1290 1295 



5860 



tgg tea gat ctt gtg tgc teg gtg ttt tte aac agt cgc ett tat gtc 
Trp Ser Asp Leu Val Cys Ser Val Phe Phe Asn Ser Ax^ Leu Tyr Val 
1300 1305 1310 



5908 



gtc tgg gtc gaa gag aat cag tot get gat acg gag gca gag age aeg 
Val Trp Val Glu Glu Asn Oln Ser Ala Asp Thr Glu Ala Glu Ser Thr 
1315 1320 1325 



5956 



aca acc acg cag cag age tac acg ctg aaa ctg teg ttc egg cgc tac 
Thr Thr Thr Gin Gin Ser Tyr Thr Leu Lys Leu Ser Phe Arg Arg Tyr 
1330 1335 1340 



6004 



gac ggt aca tgg agt tee ccg gtg teg ttc gac att acc gge aac ate 
Asp GlV Thr Trp Ser Ser Pro Val Ser Phe Asp He Thr Gly Asn He 
1345 1350 1355 



6052 



gca ttt ccg gaa acg cag gge atg cat gtg acc tgt aat ccc ctg act 
Ala Phe Pro Glu Thr Gin Gly Met His Val Thr Cys Asn Pro Leu Thr 
1360 1365 1370 1375 



6100 



gag cag etc tat tgc gcg ttt tac tec gtc acc age aag ccg gac ttt 
Glu Gin Leu Tyr Cys Ala Phe Tyr Ser Val Thr Ser Lys Pro Asp Phe 
1380 1385 1390 



6146 



gat aac get cag ctg att tct gtg gat aat gat atg acg eta aat gtc 
Asp Asn Ala Gin Leu He Ser Val Asp Asn Asp Met Thr Leu Asn Val 
1395 1400 1405 



6196 



ate tea gat ata ggg att ttt aag age gtc agt cac gaa ttt aat acg 
He Ser Asp He Gly He Phe Lys Ser Val Ser His Glu Phe Asn Thr 
1410 1415 1420 



6244 



age act gag aaa ttt att aat aat gtt ttt tea gac cet tec get aat 
Ser Thr Glu Lys Phe He Asn Asn Val Phe Ser Asp Pro Ser Ala Asn 
1425 1430 1435 



6292 



tat ttt gtc agt gca acg agt tta att gat gat gtt ate cac age gat 
Tyr Phe Val Ser Ala Thr Ser Leu He Asp Asp Val He His Ser Asp 
1440 ^ 1445 1450 1455 



6340 



ttc tea etc ctt aat tct aaa act aca agt act gtt ttt act aat gaa 
Phe Ser Leu Leu Asn Ser Lys Thr Thr Ser Thr Val Phe Thr Asn Glu 
1460 1465 1470 



6388 



gat tec tct ctt ttg acg cca gag ctt cat att aca gca aat gtt teg 
Asp Ser Ser heu Leu Thr Pro Glu Leu His He Thr Ala Asn Val Ser 
1475 1480 1485 



6436 
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tgt ttt gtt agt act get ggc ate gcc act caa tct acc ata gaa aaa 
Cys Phe Val Ser Thr Ala Gly lie Ala Thr Gin Ser Thr He Glu Lys 
1490 1495 1500 



£464 



ttc gtt cag gca ggg ata gaa ttt gag gaa atC aat ttt tat gca ggc 
Phe Val\Gln Ala Gly He Glu Phe Glu Glu He Asn Phe Tyr Ala Gly 
1505 1510 1515 



6532 



cag gcc gcc ggc gga ttt gac gga ttt gtg gga gtg gat gtt tct aat 
Gin Ala Ala Gly Gly Phe Asp Gly Phe Val Gly Val Asp Val Ser Asn 
1520 1525 1530 1535 



6580 



tea aaa gta tac cag gtc gga aaa gaa gca gtt ggt gtc act gta aaa 
Ser Lys Val Tyr Gin Val Gly Lys Glu Ala Val Gly Val Thr Val Lys 
1540 1545 1550 



6628 



tct tat tec gtc act ggc gtt agt ggt tct gtt gag tta ttt att gat 
Ser Tyr Ser Val Thr Gly Val Ser Gly Ser Val Glu Leu Phe He Asp 
1555 1560 1565 



6676 



tea tea aat aaa tac ttc age gga att ttg tea gat aaa atg ata acc 
Ser Ser Asn Lys Tyr Phe Ser Gly He Leu Ser Asp Lys Met He Thr 
1570 1575 1580 



6724 



get tta att age ggc agt aca tea aaa gtt aat tac gtg teg tct att 
Ala Leu He Ser Gly Ser Thr Ser Lys Val Asn Tyr Val Ser Ser He 
1585 1590 1595 



6772 



ggc tct caa gat ttt tgg agt gta aag teg etc atg ccg gca ett cag 
Oly Ser Gin Asp Phe Trp Ser Val Lys Ser Leu Met Pro Ala Leu Gin 
1600 1 1605 1610 161S 



6820 



ata tat gaa tta ate gat gat ate ata ctg aca tec ggc gta aat ggg 
He Tyr Glu Leu He Asp Asp He He Leu Thr Ser Gly Val Asn Gly 
1620 1625 1630 



6868 



act gaa att aaa tec tgg ect tec get gaa tgg tat aat gat aag ctg 
Thr Glu He Lys Ser Trp Pro Ser Ala Glu Trp Tyr Asn Asp Lys Leu 
1635 1640 1645 



6916 



agt ctg caa tec ggg aat aat ett ttc aac ace aaa teg ctg agt ttt 
Ser Leu Gin Ser Gly Asn Asn Leu Phe Asn Thr Lys Ser Leu Ser Phe 
1650 1655 1660 



6964 



acc gtt aat acc agt gat att gtt gaa gat gag ttt gac gtg aeg ttt 
Thr Val Asn Thr Ser Asp He Val Glu Asp Glu Phe Asp Val Thr Phe 
1665 1670 1675 



7012 



acg ttc acc get gtc gat cag aat aac gtc gtg ctg gee gee egg acg 
Thr Phe Thr Ala Val Asp Gin Asn Asn Val Val Leu Ala Ala Arg Thr 
1680 1685 1690 1695 



7060 



gcc ata tta acc gtc att cga aac att aat aat gac act tec gtt ate 
Ala He Leu Thr Val He Arg Asn He Asn Asn Asp Thr Ser Val He 
1700 1705 1710 



7108 



gca tta cgt aaa aat acg cgt ggc gcg cag tat att cgt ttc act gcg 



7156 
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Ala Leu Arg Lys Asn Thr Arg Gly Ala Gin Tyr lie Arg Phe Thr Ala 
1715 1720 1725 



ggt aac gat gtg gcg ctt att cgc etc aac acc etc ttt gcc cgc caa 
Gly Asn Asp Val Ala Leu He Arg Leu Asn Thr Leu Phe Ala Arg Gin 
1730 . 1735 1740 



7204 



ctg gtc gac egg gcg aat acc ggg att gac acc att ctt tec atg gag 
Leu Val Asp Arg Ala Asn Thr Gly He Asp Thr He Leu Ser Met Qlu 
1°745 1750 1755 



7252 



acc cag agg ctt acc gaa ccc gcc ctg gaa gag ggg agt gat gtg ttt 
Thr Gin Arg Leu Thr Glu Pro Ala Leu Glu Glu Gly Ser Asp Val Phe 

1760 1765 1770 1775 



7300 



atg gac ttc tec gga gcc aat gcc etc tat ttc tgg gag ctg ttc tat 
Met Asp Phe Ser Gly Ala Asn Ala Leu Tyr Phe Tzp Glu Leu Phe Tyr 
1780 1785 1790 



7348 



tac acg ccg atg atg gtg ttc cag egg ttg ttg cag gaa cag cac ttc 
Tyr Thr Pro Met Met Val Phe Gin Arg Leu Leu Gin Glu Gin His Phe 
1795 1800 1805 



7396 



ccg gaa gcc acc cgc tgg ctg cag tat gtc tgg aac ccg gcc ggg cac 
Pro Glu Ala Thr Arg Trp Leu Gin Tyr Val Tzp Asn Pro Ala Gly His 
1810 1815 1820 



7444 



gtg gta aac ggg gtg ctg cag aat tac acc tgg aat gtc cgt ccg ctg 
Val Val Asn Gly Val Leu Gin Asn Tyr Thr Tzp Asn Val Arg Pro Leu 
1825 1830 1835 



7492 



gag gag 'gac acc ggc tgg aac gac teg ccg ctg gac tec att gac ccc 
Glu Glu Asp Thr Gly Tzp Asn Asp Ser Pro Leu Asp Ser He Asp Pro 
1840 1845 1850 1855 



7540 



gat gca ata gcc cag tac gac ccc atg cat tac aag gtc gcc acc ttt 
Asp Ala He Ala Gin Tyr Asp Pro Net His Tyr Lys Val Ala Thr Phe 
1860 1865 1870 



7S88 



atg tc^ tac etc gac ctg ctg att gcc cgc ggt gat gcc gcc tac egg 
Met Ser Tyr Leu Asp Leu Leu He Ala Arg Gly Asp Ala Ala Tyr Arg 
1875 1880 1885 



7636 



ctg etc gag egg gac acc ctt aac gag gcc egg atg tgg tac gtc cag 
Leu Leu Glu Arg Asp Thr Leu Asn Glu Ala Arg Met Trp Tyr Val Gin 
1890 1895 1900 



7684 



gcc ctg aac ctt ctg ggc gac gag ccc tat att tec ttt gac gcc gac 
Ala Leu Asn Leu Leu Gly Asp Glu Pro Tyr He Ser Phe Asp Ala Asp 
1905 1910 1915 



7732 



tgg teg gcg ttg acc ctg ggt gac gca gcc age gag gtg acg cga cgc 
Trp Ser Ala Leu Thr I*eu Gly Asp Ala Ala Ser Glu Val Thr Arg Arg 
1920 1925 1930 1935 



7780 



gat tac cag gag gcc ctg ctg gcc gtg cgc egg ttg gtg ccc get ccc 
Asp Tyr Gin Glu Ala Leu Leu Ala Val Arg Arg Leu Val Pro Ala Pro 
1940 1945 1950 



7828 
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gag aca egg acg gcg aaC tec ctg acg gca ctg ttc ccc ccg cag cag 
Glu Thr Arg Thr Ala Asn Ser Leu Thr Ala Leu Phe Leu Pro Gin Gin 
19S5 1960 1965 



7876 



aac gag gtg etc aaa ggc tac tgg caa acc ttg gca cag egg etc cat 
Asn Glu Val Leu Lys Gly Tyr Trp Gin Thr Leu Ala Oln Arg Leu His 
1970 1975 1980 



7924 



aac ctg cgc cac aac etc tec att gac ggc cag ccg ctt tec ctg tec 
Asn Leu Arg His Asn Leu Ser lie Asp Gly Gin Pro Leu Ser Leu Ser 
1985 1990 1995 



7972 



gtc tac gee acg ccg tec gaa ccg tec gee ctg cag agt gee gtc gtc 
Val Tyr Ala Thr Pro Ser Glu Pro Ser Ala Leu Gin Ser Ala Val Val 
2000 2005 2010 2015 



8020 



aac age gcg cag ggt get gca gca ctg ccg gee gcg gtg atg ccg ctt 
Asn Ser Ala Gin Gly Ala Ala Ala Leu Pro Ala Ala Val Met Pro Leu 
2020 2025 2030 



8068 



tac agt ttc ccg gtc atg ctg gag aac gcc egg ggg atg gtg age ctg 
Tyr Ser Vhe Pro Val Met Leu Glu Asn Ala Arg Gly Met Val Ser Leu 
2035 2040 2045 



8116 



Ctg acc ggg ttc ggc aac aca ctg etc ggt att acc gag cgt cag gat 
Leu Thr Gly Phe Gly Afin Thr Leu Leu Gly tie Thr Glu Arg Gin Asp 
2050 2055 2060 



8164 



gcg gag gcg ctg gcc aaa ctg ctg cag acc cag ggc agt gaa ctg ata 
Ala Glu Ala Zieu Ala Lys Leu Leu Gin Thr Gin Gly Ser Glu I<eu lie 
2065 2070 2075 



8212 



cgc cag ggc ctt cgc cag cag gat aac gtc etc gag gaa ate gat gcg 
Arg Gin Gly Leu Arg Gin Oln Asp Asn Val Leu Glu Glu lie Asp Ala 
2080 2085 2090 2095 



8260 



gat att gcc gcc ctg gag gag age cgc cgc ggc gcg cag atg cgt ttt 
Asp He Ala Ala Leu Glu Glu Ser Arg Arg Gly Ala Gin Met Arg Phe 
2100 2105 2110 



8308 



gaa cgt tac aaa gtg ttg tac gag gcg gac gtc aac acc ggc gaa aaa 
Glu Arg Tyr Lys Val Leu Tyr Glu Ala Asp Val Asn Thr Gly Glu Lys 
2115 2120 2125 



8356 



cag gcc atg gac ttg tac etc agt teg tec gtg ctg teg gca tea acc 
Gin Ala Met Asp Leu Tyr Leu Ser Ser Ser Val Leu Ser Ala Ser Thr 
^130 2135 2140 



8404 



gcc gcg etc ttt ttg gcc gag gcc gcg gcc gat atg ctg ccc aat att 
Ala Ala Leu Phe Leu Ala Glu Ala Ala Ala Asp Met Leu Pro Asn He 
2145 2150 2155 



8452 



tac ggg ctg gcc gtc ggg ggc tec cgc tat ggg gca eta ttt aaa gcc 
Tyr Gly Leu Ala Val Gly Gly Ser Arg Tyr Gly Ala Leu Phe Lys Ala 
2160 2165 2170 2175 



8500 



acc gcc ate ggc ate cag gtg tec tec gat gee acc cgc ata tea gcg 



8548 
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thr Ala lie Gly lie Gin Val Ser Ser Asp Ala Thr Arg He Ser Ala 
2180 2185 2190 



gac aaa ate age cag teg gaa gtg tac cgc egt cgc egg gag gag tgg 
Asp Lys He Ser Gin Ser Glu Val Tyr Arg Arg Arg Arg Glu Glu Trp 
2195 2200 220S 



8596 



gaa ate «ag egt gat agt gcg cag tct gac gtg gcg cag att gat gee 
Glu He Gin Arg Asp Ser Ala Gin Ser Asp Val Ala Gin He Asp Ala 
2210 2215 2220 



8644 



cag ctg gcg gee atg gca gtg cgc egg gaa ggg get gag etg cag aaa 
Gin Leu Ala Ala Met Ala Val Arg Arg Glu Gly Ala Glu Leu Gin Lys 
2225 2230 2235 



8692 



act tac ctt gag acc cag cag acc cag gca cag gcg cag ttg gca ttc 
Thr Tyr Leu Glu Thr Gin Gin Thr Gin Ala Gin Ala Gin Leu Ala Phe 
2240 2245 2250 2255 



8740 



ctg cag agt aag ttc aac aat acg get ctg tac age tgg ctg egg ggc 
Leu Gin Ser Lys Phe Asn Asn Thr Ala Leu Tyr Ser Trp Leu Arg Gly 
2260 2265 2270 



6788 



agg ttg tee gee att tat tac cag ttc tat gac ctg gca gta tec cgc 
Arg Leu Ser Ala He Tyr Tyr Gin Phe Tyr Asp Leu Ala Val Ser Arg 
2275 2280 2285 



8836 



tgc ctg atg gcg caa cag gee tgg cag tgg gat aaa ttc gag act agg 
CyB Leu Met Ala Gin Gin Ala Trp Gin Tzp Asp Lys Phe Olu Thr Arg 
2290 2295 2300 



8884 



teg ttt ate cag ccg ggg gee tgg atg ggg gca aat gee ggt ctg ctg 
Ser Phe^Ile Gin Pro Gly Ala Trp Met Gly Ala Asn Ala Gly Leu Leu 
2305 2310 2315 



8932 



gee ggg gaa acc ctg atg ctg aat ctg gcg cag atg gag cag gee tgg 
Ala Gly Glu rthr Leu Met Leu Asn Leu Ala Gin Net Glu Gin Ala Trp 
2320 2325 2330 2335 



8980 



ctg acg ggg gat gag egg gca ata gag gtg acg egg acg gtc tgc ctg 
Leu Thr Gly Asp Glu Arg Ala He Glu Val Thr Arg Thr Val Cys Leu 
2340 2345 2350 



9028 



teg gag gtc tat acc age etc gcg gag gat gcg gca ttc tct ctg gee 
Ser Glu Val Tyr Thr Ser Leu Ala Glu Asp Ala Ala Phe Ser Leu Ala 
2355 2360 2365 



9076 



gac aag gtg gtg gaa ctg gtc agt aac ggt teg ggc agt gcg ggt acg 
Asp Lys Val Val Glu Leu Val Ser Asn Gly Ser Gly Ser Ala Gly Thr 
2370 2375 2380 



9124 



aaa age aac gga tta cag atg gat caa cag caa etc gag gee acc etg 
Lys Ser Asn Gly Leu Gin Met Asp Gin Gin Gin Leu Glu Ala Thr Leu 
2385 2390 2395 



9172 



aaa ctg get gac etc ggt ate ggc aac gat tac ccg gtc tec ctt ggc 
Lys Leu Ala Asp Leu Gly He Gly Asn Asp Tyr Pro Val Ser Leu Gly 
2400 240S 2410 2415 
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acc atg agg cgc ate aaa caa ata age gtc acg etc ccg gcg ctg gtc 
Thr Met Arg Arg He Lys Gin He Ser Val Thr Leu Pro Ala Leu Val 
2420 2425 2430 



9268 



ggc ccc tat cag gac gtc cgt gcg gtt etc age tac ggc gga agt atg 
Gly Pro Tyr Gin Asp Val Arg Ala val Leu ser Tyr Gly Oly Ser Met 
2435 2440 2445 



9316 



gtc atg ccc egg ggt tgc age gcg ctg gcg gtc tea cac gga atg aac 
Val Met Pro Arg Gly Cys Ser Ala Leu Ala Val Ser Hie Gly Met Asn 
2450 2455 2460 



9364 



gac age ggc caa ttc caa ctg gat tte aat gac ccg cgt tac ctg ccg 9412 



Asp Ser Gly Gin Phe Gin Leu Asp Phe Asn Asp Pro Arg Tyr Leu Pro 
2465^ 



2470 



2475 



ttt gaa gga ctt cca gtt gat gac aca ggg acc ctg aca ctg age ttc 
Phe Glu Gly Leu Pro Val Asp Asp Thr Gly Thr Leu Thr Leu Ser Phe 
2480 2485 2490 2495 



9460 



ccg gat get gac ggc aaa caa cag gcg atg etc etc agt ctg age gac 
Pro Asp Ala Asp Gly Lys Gin Gin Ala Met Leu Leu Ser Leu Ser Asp 
2500 2505 2510 



9508 



ate ate ctg cat ate cgt tac acc att ate age tga tag gtatcaacat 
He He Leu His He Arg Tyr Thr He He Ser 
2515 2520 



9557 



agcgcaggce cccgaacgag ggcctgegag gagactgagc atg caa aat cat caa 

. Net Gin Asn His Gin 
2525 



9612 



gac atg gee att act gcc ccc acg ttg cet tec ggg ggc ggt gcg gtc 
Asp Met Ala He Thr Ala Pro Thr Leu Pro Ser Gly Oly Gly Ala Val 
2530 2535 2540 2545 



9660 



acc ggg etc aag ggt gat ate gcg gcg gca ggg ccg gat ggt gcg gcg 
Thr Gly Leu Lye Gly Asp He Ala Ala Ala Gly Pro Asp Gly Ala Ala 
2550 2555 2560 



9708 



acc ctg agt att ccc ttg ccg gtt age ccc ggt egg ggt tac gcc ccc 
Thr Leu Ser He Pro Leu Pro Val Ser Pro Oly Arg Gly Tyr Ala Pro 
2565 2570 2575 



9756 



act ggg gca ctt aat tat cac age egg teg ggg aac ggc ccc ttt ggc 
Thr Gly Ala Leu Asn Tyr His Ser Arg Ser Gly Asn Gly Pro Phe Gly 
2580 2585 2590 



9804 



att ggc tgg ggt ate ggc ggt get get gtc cag cgt cgt acg cgc aac 
He Gly Trp Gly He Gly Oly Ala Ala Val Gin Arg Arg Thr Arg Asn 
2595 2600 2605 



9852 



gga gca cet acc tac gat gat act gat gaa tte acc ggt ccg gac ggt 
Gly Ala Pro Thr Tyr Asp Asp Thr Asp Glu Phe Thr Oly Pro Asp Gly 
2610 2615 2620 2625 



9900 



gag gtg ctg gtg ccg gca etc acg get get ggc acc caa gaa gca egg 



9948 
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Glu Val Leu Val Pro Ala Leu Thr Ala Ala Gly Thr Gin Glu Ala Arg 
2630 2635 2640 



cag gcc acc tea eta ctg ggg ata aac cca ggc gga age ttc aac gtt 
Gin Ala Thr Ser Leu Leu Gly lie Asn Pro Gly Oly Ser Phe Asn Val 
2645 2650 2655 



9996 



cag gtt tac cgt tea cgt acg gag ggt agt etc age ege ett gag cgt 
Gin Val Tyr Arg Ser Arg Thr Glu Gly Ser Leu Ser Arg Leu Glu Arg 
2660 2665 2670 



10044 



tgg Ctg cec gee gac gag aea gaa acg gaa ttt tgg gtg tta tat acc 
Trp Leu Pro Ala Asp Glu Thr Glu Thr Glu Phe Trp Val Leu Tyr Thr 
2675 2680 2685 



10092 



cct gac gga cag gtg get ctg ctg ggc cga aat gcg cag get cgc ate 
Pro Asp Gly Gin Val Ala Leu Leu Gly Arg Asn Ala Gin Ala Arg He 
2690 2695 2700 2705 



10140 



age aac ccc aca gcc cca aca cag acg gcg gtt tgg ctg atg gag tec 
Ser Asn Pro Thr Ala Pro Thr Oln Thr Ala Val Trp Leu Met Glu Ser 
2710 2715 2720 



10188 



teg gta tea ett acc ggc gaa cag atg tat tac caa tac cgt gcg gaa 
Ser Val Ser Leu Thr Gly Glu Gin Met Tyr Tyr Gin Tyr Arg Ala Glu 
2725 2730 2735 



10236 



gat gat gac ggt tgt gac gag gcg gag ege gac gcg cae ccg cag gcc 
Asp Asp Asp Gly Cys Asp Glu Ala Glu Arg Asp Ala His Pro Gin Ala 
2740 274S 2750 



10284 



ggc gee caa cgt tat ccg gtg gcg gtc tgg tat ggt aac cgt cag gcg 
Gly Ala Gin Arg Tyr Pro Val Ala Val Trp Tyr Gly Asn Arg Gin Ala 
2755 2760 2765 



10332 



get egg acg eta ccg gcg ctg gtg teg aca cca tea atg gat age tgg 
Ala Arg Thr Leu Pro Ala Leu Val Ser Thr Pro Ser Met Asp Ser Trp 
2770 2775 2780 2785 



10380 



Ctg ttt ate ctg gtg ttt gat tat ggt gag cgt age teg gtg ctg tet 
Leu Phe lie Leu Val Phe Asp Tyr Gly Glu Arg Ser Ser Val Leu Ser 
2790 2795 2800 



10426 



gaa gcg ccg gcc tgg caa aca cca gga agt ggg gag tgg ctg tgt cgt 
Glu Ala Pro Ala Trp Oln Thr Pro Gly Ser Oly Glu Trp Leu Cys Arg 
2805 2810 2615 



10476 



cag gat tgt ttt tec ggg tat gag ttt ggt ttt aac ctg egg act cgc 
Gin Asp Cys Phe Ser Gly Tyr Glu Phe Oly Phe Asn Leu Arg Thr Arg 
2820 2825 2830 



10524 



cgc ctg tgc cgt cag gtt ttg atg ttc cat tac eta ggt gtt ctg gcg 
Arg Leu Cys Arg Gin Val Leu Met Phe His Tyr Leu Gly Val Leu Ala 
2835 2840 2845 



10572 



ggg agt teg gga gcg aat gat gcg cca gca ttg att tct cgc ctg ttg 
Gly Ser Ser Gly Ala Asn Asp Ala Pro Ala Leu He Ser Arg Leu Leu 
2850 2855 2860 2865 
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ctg gac tac agg gaa agt cct tea etc agt ctg etc gag aac gtg cac 
Leu Asp Tyr Arg Glu Ser Pro Ser Leu Ser Leu Leu Glu Asn Val His 
2670 2B75 28B0 



10668 



eag gtg get tat gag teg gac ggg acg tct tgt gee ttg ceg gca ctg 
Gin Val^ Ala Tyr Glu Ser Asp 61y Thr Ser Cys Ala Leu Pro Ala Leu 
2885 2890 2895 



10716 



gca ttg ggg tgg caa acc ttt acc ceg ceg aca ttg teg gca tgg cag 
Ala Leu Gly Trp Gin Thr Phe Thr Pro Pro Thr Leu Ser Ala Trp Gin 
2900 2905 2910 



10764 



acg cgt gac gat atg ggc aag ttg agt ttg ctt caa ecc tat cag ctt 
Thr Arg Asp Asp Met Gly Lye Leu Ser Leu Leu Gin Pro Tyr Qln Leu 
2915 2920 2925 



10812 



gta gac ctt aac ggc gaa ggt gtg gtg ggt ate ctg tat cag gac age 
Val Asp Leu Asn Gly Glu Oly Val Val Gly lie Leu Tyr Gin Asp Ser 
2930 2935 2940 2945 



10B60 



ggt gcc tgg tgg tac cgt gaa ceg gta cgc cag teg ggg gat gat ceg 
Gly Ala Trp Trp Tyr Arg Glu Pro Val Arg Gin Ser Gly Asp Asp Pro 
2950 2955 2960 



10908 



gat get gtg acc tgg ggg gcg get gcg gcc ctg ceg aca atg ecc get 
Asp Ala Val Thr Trp Oly Ala Ala Ala Ala Leu Pro Thr Met Pro Ala 
2965 2970 2975 



10956 



ttg cat aac age ggc ate ctg gcg gat ctt aat ggg gat ggt egg ctg 
Leu His Asn Ser Gly lie Leu Ala Asp Leu Asn Gly Asp Gly Arg Leu 
2980 2985 2990 



11004 



gag tgg gte gtt acc gcc ecc ggt gtg gcg ggg atg tat gat cgc acc 
Glu Trp val Val Thr Ala Pro Gly Val Ala Oly Met Tyr Asp Arg Thr 
2995 3000 3005 



11052 



ecc ggc cgc gac tgg ttg cat ttc acc ecc ctg tea gcc ttg ecc gta 
Pro Gly Arg Asp Trp Leu Bis Phe Thr Pro Leu Ser Ala I«eu Pro Val 
3010 3015 3020 3025 



11100 



gaa tat gcg cat cea aaa gca gtg etc gcc gat ate ctg ggg get ggg 
Glu Tyr Ala His Pro Lys Ala Val Leu Ala Asp He Leu Oly Ala Oly 
3030 3035 3040 



11148 



tta acg gac atg gtg ctt ate ggg ceg cgc agt gtt cgc etc tat tee 
Leu Thr Asp Met Val Leu He Oly Pro Arg Ser Val Arg Leu Tyr Ser 
3045 3050 3055 



11196 



ggc aaa aac gat ggt tgg aat aaa ggg gag acc gtg cag caa acg gaa 
Gly Lys Asn Asp Gly Trp Asn Lys Oly Glu Thr Val Gin Gin Thr Glu 
3060 3065 3070 



11244 



aga etc act ctg ceg gtc ceg ggg gtt gac eca cgt ace etc gtg gcg 
Arg Leu Thr Leu Pro Val Pro Gly Val Asp Pro Arg. Thr Leu Val Ala 
3075 3080 3085 



11292 



ttc a^t gat atg get ggc agt gga cag cag cat ttg acg gag gtg cgt 1134 0 
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Phe Ser Asp Met Ala Gly Ser Gly Gin Gin His Leu Thr Glu Val Arg 
3090 3095 3100 3105 



get aat gga gta cgt tac tgg cca aac ccg ggg cac ggt cgt ttc ggt 
Ala Asn Gly Val Arg Tyr Trp Pro Asn Leu Gly His Gly Arg Phe Gly 
3110 3115 3120 



11388 



cag ccg gtg aat att ccc ggt ttt age cag tea gtg act acg ttt aac 
Gin Pro Val Asn lie Pro Gly Phe Ser Gin Ser Val Thr Thr Phe Aen 
3125 3130 3135 



11436 



cct gac cag ata ttg ctg gcc gat ace gac ggt tec ggt acc acg gae 
Pro Asp Gin lie Leu Leu Ala Asp Thr Asp Gly Ser Gly Thr Thr Asp 
3140 3145 3150 



114B4 



ctg att tat gcg atg agt gac egg tta gtc att tat ttc aac cag agt 
Leu lie Tyr Ala Met Ser Asp Arg Leu Val lie Tyr Phe Asn Gin Ser 
3155 3160 3165 



11532 



ggt aat tat ttc gcc gag ccg cat acg ctg etc ttg ccg aaa ggt gtg 
Gly Asn Tyr Phe Ala Glu Pro His Thr Leu Leu Leu Pro Lys Gly Val 
3170 3175 3180 3185 



11580 



cgc tat gat cgc acc tgc agt ctg caa gtg gcg gat ate cag ggg ctg 
Arg Tyr ^ Asp Arg Thr Cys Ser Leu Gin Val Ala Asp lie Gin Gly Leu 
3190 3195 3200 



11628 



ggg gtg cct age ctg tta ctg acg gtc ccc cat gtc gcg cct cat cac 
Gly Val Pro Ser Leu Leu Leu Thr val Pro His Val Ala Pro His His 
3205 3210 3215 



11676 



tgg gtg tgc cat tta teg gca gac aaa ccc tgg ttg ttg aat ggc atg 
Trp Val Cys His Leu Ser Ala Asp Lys Pro Trp Leu IfCu Asn Gly Met 
3220 3225 3230 



11724. 



aac aac aat: atg ggg gcc egg cat gca ctg cac tat cgc agt teg gtg 
Asn Asn Asn Met Gly Ala Arg His Ala Leu His .Tyr Arg Ser Ser Val 
3235 3240 3245 



11772 



cag ttc tgg ctg gat gag aaa gcc gag gca ctg gcg gca ggc agt tec 
Oln Phe Trp I«eu Asp Glu Lys Ala Glu Ala Leu Ala Ala Gly Ser Ser 
3250 3255 3260 3265 



11820 



cct gcc tgc tac ctg cca ttt aca ttg cat acc ctg tgg cgt teg gtg 
Pro Ala Cys Tyr Leu Pro Phe Thr Leu Ris Thr Leu Trp Arg Ser Val 
3270 3275 3280 



11868 



gtg cag gat gag ate acc ggt aac cgt ctg gtc age gac gtg ctt tat 
Val Gin Asp Glu lie Thr Gly Asn Arg Leu Val Ser Asp Val Leu Tyr 
3285 3290 3295 



11916 



cgc cac ggc gtc tgg gac ggg cag gaa cgc gag ttt egg ggg ttt ggt 
Arg His Gly Val Trp Asp Gly Gin Glu Arg Glu Phe Arg Gly Phe Gly 
3300 3305 3310 



11964 



ttt gtt gag ate agg gat ace gat acc ttg gca age cag ggt acg gcg 
Phe Val Glu He Arg Asp Thr Asp Thr Leu Ala Ser Gin Gly Thr Ala 
3315 3320 3325 



12012 
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acg gaa ctg agt atg cct tct gtg age egg aac tgg tat gcc acc ggg 
Thr Glu Leu Ser Met Pro Ser Val Ser Arg Asn Trp Tyr Ala Thr Gly 
3330 333S 3340 3345 



12060 



gta ccg gca gta gac gag cgt ctg ccg gag acg tat tgg caa aac gat 
Val Pro Ala Val Asp Glu Arg Leu Pro Glu Thr Tyr Trp Gin Asn Asp 
3350 3355 3360 



12108 



gcc gcc get ttt gcc gat ttc gcg acc cgt ttc act gtc ggt tea gga 
Ala Ala Ala Phe Ala Asp Phe Ala Thr Arg Phe Thr Val Gly Ser Gly 
3365 3370 3375 



12156 



gag gat ^gag cag aca tat act ccg gac gac age aag aca ttc tgg ttg 
Glu Asp Glu Gin Thr Tyr Thr Pro Asp Asp Ser Lys Thr Phe Trp Leu' 
3380 3385 3390 



12204 



cag cga gcc ctg aaa ggc ate ctg ctg cgc agt gag tta tac ggt gcc 
Gin Arg Ala Leu Lye Gly He lieu Leu Arg Ser Glu I«eu Tyr Gly Ala 
3395 3400 3405 



12252 



gat ggc age age cag gcc gat ate cct tac age gtc act gag tct cgc 
Asp Gly Ser Ser Gin Ala Asp He Pro Tyr Ser Val Thr Glu Ser Arg 
3410 3415 3420 3425 



12300 



ccg cag gta egg eta gtt gaa gcg aat gga gac tac ccg gtg gtg tgg 
Pro Gin Val Arg Leu Val Glu Ala Asn Gly Asp Tyr Pro Val Val Trp 
3430 3435 3440 



12348 



ccg atg ggc gcg gaa age cgt acg tea gtt tat gaa egg tac cae aat 
Pro Met Gly Ala Glu Ser Arg Thr Ser Val Tyr Glu Arg Tyr His Asn 
3445 3450 3455 



12396 



gat cct caa tgc caa cag cag gcg gta etc etc agt gat gaa tac ggt 
Asp Pro Gin Cys Gin Gin Gin Ala Val Leu Leu Ser Asp Glu Tyr Gly 
3460 3465 3470 



12444 



ttc cca ctg cgt cag gtc agt gtc aat tat cca cga cgc cct ccg teg 
Phe Pro^ Leu Arg Gin Val Ser Val Asn Tyr Pro Arg Arg Pro Pro Ser 
3475 3480 3465 



12492 



gcg gac aat cca tat ccg gcg tec tta ccg gcg acg ctg ttc gcc aac 
Ala Asp Asn P3CO Tyr Pro Ala Ser Leu Pro Ala Thr Leu Phe Ala Asn 
3490 3495 3500 3505 



12540 



agt tat gac gag cag cag cag ata tta cgc ctg ggg ttg caa cag age 
Ser Tyr Asp Glu Gin Gin Gin He Leu Arg Leu Gly Leu Gin Gin Ser 
3510 3515 3520 



12588 



agt gca cat cac ctt gtt tea ctg tct gag ggg cat tgg ttg ttg ggg 
Ser Ala His His Leu Val Ser Leu Ser Glu Gly His Trp Leu I«eu Gly 
3525 3530 3535 



12636 



ttg gcg gag gcg teg egg gac gat gta ttc acg tac tct gcg gac aac 
Leu Ala Glu Ala Ser Arg Asp Asp Val Phe Thr Tyr Ser Ala Asp Asn 
3540 3545 3550 



12684 



gtg ccg gaa ggg ggt ctg acg ctg gaa cac ctg ttg gcg ccc gaa age 12732 
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Val Pro Glu Gly Gly Leu Thr Leu Glu His Leu Leu Ala Pro Qlu Ser 
3555 3560 3565 



ctg gtc teg gat agt cag gtc ggt acg ctg gcg ggt cag cag caa gtc 
Leu Val Ser Asp Ser Gin Val Gly Thr Leu Ala Gly Gin Gin Gin Val 
3570 3575 3580 3585 



12780 



tgg tat ctg gat tea caa gac gtt gcc acc gtc get get ccg cca etc 
Trp Tyr I-eu Asp Ser Gin Asp Val Ala Thr Val Ala Ala Pro Pro Leu 
3590 3595 3600 



12828 



ccc ccc aag gta get ttt ate gaa acg gcc gtg ctg gat gag ggt atg 
pro Pro Lys Val Ala Phe lie Glu Thr Ala Val Leu Asp Glu Gly Met 
360S 3610 3615 



12876 



gtc agt tea ctg get gcc tac att gtg gat gaa cat etc gag caa gcc 
Val Ser Ser Leu Ala Ala Tyr lie Val Asp Glu His I^u Glu Oln Ala 
3620 3625 3630 



12924 



ggt tac egg caa tec gga tac ett ttc cct cga ggc agg gaa gca gaa 
Gly Tyr^ Arg Gin Ser Gly Tyr Leu Phe Pro Arg Gly Arg Glu Ala Glu 
3635 3640 3645 



12972 



cag gca ttg tgg ace cag tgt cag gga tat gtt acc tat gcc ggc gca 
Gin Ala Leu Trp Thr Gin Cys Oln Gly Tyr Val Thr Tyr Ala Gly Ala 
3650 3655 3660 3665 



13020 



gag eat ttc tgg eta ccg eta tec ttt egg gac agt atg ttg ace ggc 
Glu His Phe Trp Leu Pro Leu Ser Phe Arg Asp Ser Met Leu Thr Gly 
3670 3675 3680 



13068 



cca gtt acc gtg acg cgt gac gcg tac gac tgc gtc ate acg cag tgg 
Pro Val Thr Val Thr Arg Asp Ala Tyr Asp Cys Val lie Thr Gin Trp 
3685 3690 3695 



13116 



cag gat gee gca ggg att gtc acc aca gcc gac tat gac tgg cgc ttc 
Oln Asp Ala Ala Gly lie Val Thr Thr Ala Asp Tyr Asp Trp Arg Phe 
3700 3705 3710 



13164 



Ctg acg ccc gtc egg gtg acg gac ccc aat gat aat ctg cag tec gtc 
Leu Thr Pro Val Arg Val Thr Asp Pro Asn Asp Asn Leu Gla Ser Val 
3715 3720 3725 



13212 



act ctg gat get ctg ggc egg gtg acc acc ctg cga ttc tgg ggc acg 
Thr Leu Asp Ala Leu Gly Arg Val Thr Thr Leu Arg Phe Trp Oly Thr 
3730 , 3735 3740 3745 



13260 



gag aat ggt att gcc acc ggt tac agt gat gcc acg ttg tec gtt ccg 
Glu Asn Gly lie Ala Thr Gly Tyr Ser Asp Ala Thr l«eu Ser Val Pro 
3750 3755 3760 



13308 



gac ggc gca gca gcc get ctg gcg ttg acg gcg ccc eta cca gta gca 
Asp Gly Ala Ala Ala Ala Leu Ala Leu Thr Ala Pro Leu Pro val Ala 
3765 3770 3775 



13356 



cag tgt ctg gtg tat gtc acg gac agt tgg gga gat gac gac aat gag 
Gin Cys Leu Val Tyr Val Thr Asp Ser Trp Gly Asp Asp Asp Asn Glu 
3780 3785 3790 



13404 
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aaa atg ccc ccg cac gtg gtc gtg ctg get acc gat cgc tat gac agt 13452 
Uya Met Pro Pro His Val Val Val Leu Ala Thr Aep Arg Tyr Asp Ser 
3795 3800 3805 

gat acc gga cag cag gtc cgc caa cag gtg aca ttc agt gac ggt ttt 13500 
Asp Thr Gly Gin Gin Val Arg Gin Gin Val Thr Phe Ser Asp Gly Phe 
3810 3815 3820 3825 

999 cgt gag ttg caa teg gca acc egg cag gcc gag ggc aac gee tgg 1354 B 
Gly Arg Glu Leu Qln Ser Ala Thr Arg Gin Ala Glu Gly Asn Ala Trp 
3830 . 3835 3840 

caa cga gga cgc gac ggc aaa ctg gtg acg gcc agt gac gga ttg ccg 13596 
Qln Arg Gly Arg Asp Gly Lye Leu Val Thr Ala Ser Asp Gly Leu Pro 
3845 3850 3855 

gtc act gta gca acg aat ttc cgc tgg gcg gtc acc ggg agg gcg gag 13644 
Val Thr Val Ala Thr Asn Phe Arg Trp Ala Val Thr Oly Arg Ala Glu 
3860 3865 3870 

tat gac aat aaa ggt ctg cct gtt egg gtt tat cag ccg tat ttt ctg 13692 
Tyr Asp Asn Lys Gly Leu Pro Val Arg Val Tyr Qln Pro Tyr Phe Leu 
3875 3880 3685 

gac agt tgg caa tat gtc agt gat gac agt gcc cgc cag gac ctg tat 13740 
Asp Ser Trp Gin Tyr Val Ser Asp Asp Ser Ala Arg Gin Asp Leu Tyr 
3890 V 3895 3900 3905 

gcc gac acg cac ttt tac gat ccg acg gca egg gaa tgg cag gtt att 13788 
Ala Asp Thr His Phe Tyr Asp Pro Thr Ala Arg Glu Trp Gin Val lie 
3910 3915 3920 

acg gca aaa ggt gaa egg cga cag gtg ctg tat acc ccg tgg ttt gtg 13836 
Thr Ala Lys Gly Glu Arg Arg Gin Val Leu Tyr Thr Pro Trp Phe Val 
3925 3930 3935 

gtc agt gaa gac gag aat gat acc gtt ggg eta aac gac gca tec tga 13884 
Val Ser Glu Asp Glu Asn Asp Thr Val Gly lieu Asn Asp Ala Ser 
3940 3945 3950 

ctgggaagga gggggggacg gtg atg agt ccg teg ccc ctg aca ggc get gcc 13937 

Met Ser Pro Ser Pro Leu Thr Gly Ala Ala 

3955 3960 

ctg atg gag aca aag atg aaa ata cac tat cag gtt gcg gcg gtt gtg 13985 
Leu Met Glu Thr Lys Met Lys lie His Tyr Gin Val Ala Ala Val Val 
3965 3970 3975 

Ctg aca ggt gtt atg gtt tgg ggg ctt tec cat tgg cgt tac acc gtc 14033 
Leu Thr Gly Val Met Val Trp Gly Leu Ser His Trp Arg Tyr Thr Val 
3980 3985 3990 3995 

ggt tac cac gcg gca gat act caa tgg caa caa cgc cag gcc gaa cag 14081 
Gly Tyr Hie Ala Ala Asp Thr Gin Trp Gin Gin Arg Gin Ala Glu Gin 
4000 4005 4010 

gaa agg gcc gat gcg ttg gcc etc ctg gca gca gaa acc egg gaa aga 14129 



• 
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Glu Arg Ala Asp Ala Leu Ala Leu Leu Ala Ala Glu Thr Arg Glu Arg 
4015 4020 4025 



aag tgg gag cag caa cga cag act gac atg aac aag gtg get ata cat 
Lys Trp Glu Gin Gin Arg Gin Thr Asp Met Asn Lys Val Ala He His 
4030 4035 4040 



14177 



get gaa gaa gaa ctg get get gcg egt gac get gee get gat get cag 
Ala Glu Glu Glu Leu Ala Ala Ala Arg Asp Ala Ala Ala Asp Ala Gin 
4045 4050 4055 



14225 



cgc act ggt cag cge ctg cag cac ace gtt ace ace etc cag egg caa 
Arg Thr Gly Gin Arg Leu Gin His Thr Val Thr Thr Leu Gin Arg Gin 
4060 4065 4070 4075 



14273 



ctt gcc agt egt gaa ace cgc cgc ctt tec gca get acc get ate ggt 
Leu Ala Ser Arg Glu Thr Arg Arg Leu Ser Ala Ala Thr Ala lie Gly 
4080 4085 4090 



14321 



aca gac gac etc gga ggc caa ccc ggc gtt ttg ttt gcc gaa ctg ttc 
Thr Asp Asp Leu Gly Gly Gin Pro Gly Val Leu Phe Ala Glu Leu Phe 
4095 4100 4105 



14369 



cgc cgc get gac cag aga gcg gga ga^ ctg gca gcg tat get gac agg 
Arg Arg Ala Asp Gin Arg Ala Gly Glii Leu Ala Ala Tyr Ala Asp Arg 
4110 4115 4120 



14417 



acc aga gtg aaa tgg cag gcc tgc ggg cgc gcc tat cag gcg get acg 
Thr Arg Val Lys Trp Gin Ala Cys Gly Arg Ala Tyr Gin Ala Ala Thr 
4125 4130 4135 



14465 



cac gaa gca gaa aaa taa ggcgatttag ccgttaagga aaagtgacgg 
His Glu Ala Glu Lys 
4140 4145 



14513 



tgttttcgcg attaatatta acaggagatc ac atg age aca tec ttg ttc agt 

Met Ser Thr Ser Leu Phe Ser 
4150 



14566 



age acc ccg teg gtc gcg gtg etc gac aac cgc ggc ctg ttg gtg egg 
Ser Thr Pro ser Val Ala Val Leu Asp Asn Arg Gly Leu Leu Val Arg 
4155 4160 4165 



14614 



gag ctg cag tac tac cgc cat ccg gat aca ccg gag gag acg gac gag 
Glu Leu Gin Tyr Tyr Arg His Pro Asp Thr Pro Glu Glu Thr Asp Glu 
4170 4175 4180 



14662 



egt ate ace tgc cat cag cac gat gag cgc ggc age ttg tea caa age 
Arg He Thr Cys His Gin His Asp Glu Arg Gly Ser Leu Ser Gin Ser 
4185 4190 4195 4200 



14710 



gcc gac ccg egg tta cac gcg gcc ggt ctg aca aat ttc acg tac ctg 
Ala Asp Pro Arg Leu His Ala Ala Gly Leu Thr Asn Phe Thr Tyr Leu 
4205 4210 4215 



14758 



aat age ctg acc ggg aca gta ctg cag age gtc age gcc gat gee ggt 
Asn Ser Leu Thr Gly Thr Val I«eu Gin Ser Val Ser Ala Asp Ala Gly 
4220 4225 4230 



14806 



ft 
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acg teg ctg gaa ctg age gat gcc gcc ggg egg gcg ttt ctg gcc gtc 
Thr Ser Leu Glu Leu Ser Asp Ala Ala Gly Arg Ala Phe Leu Ala Val 
4235 4240 4245 



14854 



acc ggg get ggg acg gaa gac gcg gtc acc cgc acc tgg caa tat gaa 
Thr Gly Ala Gly Thr Glu Asp Ala Val Thr Arg Thr Trp Gin Tyr Glu 
4250 4255 4260 



14902 



gac gat acc ctg ccg ggc cgc ccg ctg age ate acc gag cag gtt acc 
Asp A8p Thr Leu Pro Gly Arg Pro I«eu Ser lie Thr Glu Gin Val Thr 
4265 4270 4275 4280 



14950 



ggt gaa gcc gee caa att acg gaa cgc ttc gtg tac get ggc aat acg 
Gly Glu Ala Ala Gin lie Thr Glu Arg Phe Val Tyr Ala Gly Asn Thr 
4265 4290 4295 



14998 



gat gee ^ag aag att etc aat ctg get ggc cag tgt gtc agt cat tac 
Asp Ala Glu Lys He Leu Asn Leu Ala Gly Gin Cys Val Ser His Tyr 
4300 4305 4310 



15046 



gat ace gcc gga ctg gtg cag acg gac age ate gee ctg age ggc gtg 
Asp Thr Ala Gly Leu Val Gin Thr Asp Ser He Ala Leu Ser Gly Val 
4315 4320 4325 



15094 



ccg etc gee gtc acg egg cag ttg ctg cec gac gcg gcg ggg gcc aac 
Pro Leu Ala Val Thr Arg Gin Leu Leu Pro Asp Ala Ala Gly Ala Asn 
4330 4335 4340 



15142 



tgg atg ggt gag gat gcc. teg gee tgg aat gac ctg ctg gat ggg gag 
Trp Met Gly Glu Asp Ala Ser Ala Trp Asn Asp Leu Leu Asp Gly Glu 
4345 4350 4355 4360 



15190 



acg ttc ttc acc cag acc eac get gat gcg acc ggc gcc gtc ctg age 
Thr Phe Phe Thr Gin Thr His Ala Asp Ala Thr Gly Ala Val Leu Ser 
4365 4370 437S 



15238 



ate acc gat gca aaa ggt aat ctg cag egt gtg gea tat gat gtg get 
He Thr Asp Ala Lys Gly Asn Leu Gin Arg Val Ala Tyr Asp Val Ala 
4380 4385 4390 



15286 



ggg ctg eta teg ggc agt tgg ttg acg ctg aag gac ggc acg gag cag 
Gly Leu Ojcu Ser Qly Ser Trp Leu Thr Leu Lys Asp Gly Thr Olu Gin 

4395 4400 4405 



15334 



gtc ate gtg gcc tec ctg acg tac teg gee gcc ggg aaa aag ttg egt 
Val He Val Ala Ser Leu Thr Tyr Ser Ala Ala Gly Lys Lys Leu Arg 
4410 4415 4420 



15382 



gaa gaa cac ggc aac ggc g'tg gta ace teg tat att tac gag ccg gaa 
Glu Glu His Gly Asn Gly Val Val Thr Ser Tyr He Tyr Glu Pro Glu 
4425 4430 4435 4440 



15430 



aca cag cgc ctg acg ggg att aaa acg gaa egt ccg tct ggg cac gtt 
Thr Gin Arg Leu Thr Gly He Lys Thr Glu Arg Pro Ser Gly His Val 

4445 4450 4455 



15478 



gee gga gca aaa gtg ctg cag gac ctg cgc tat acg tat gac ccg gta 15526 
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Ala Gly Ala Lys Val Leu Gin Asp Leu Arg Tyr Thr Tyr Asp Pro Val 
4460 4465 4470 



ggc aac gta etc age gtc aat aac gat gcg gaa gag acc cgc ttc tgg 
Gly Asn Val Leu Ser Val Asn Asn Asp Ala Glu Glu Thr Arg Phe Trp 
4475 4480 4465 



15574 



cgt aac cag aaa gtg gta ccg gag aat acg tac ate tac gac age ctg 
Arg Asn Gin Lys Val Val Pro Glu Asn Thr Tyr lie Tyr Asp Ser Leu 

4490 4495 4500 



15622 



tac cag ctg gtc age gee aca ggg cgt gag atg gcc aat gee ggc cag 
Tyr Gin Leu Val Ser Ala Thr Gly Arg Glu Met Ala Asn Ala Gly Gin 
4505 4510 4515 4520 



15670 



cag ggc aac gac tta cea tec get aca gcc ccc ctt cct aca gac age 
Gin Gly Asn Asp Leu Pro Ser Ala Thr Ala Pro Leu Pro Thr Asp Ser 
4525 4530 4535 



1571B 



tct gcc tac acc aat tac acg cgc acc tac cgt tat gac cgt ggt ggc 
Ser Ala Tyr Thr Asn Tyr Thr Arg Thr Tyr Arg Tyr Asp Arg Gly Gly 
4540 4545 4550 



15766 



aac ctg acg cag atg cgc cac agt gcc cct gcc acg aac aat aat tat 
Asn Xjeu vThr Gin Met Arg His Ser Ala Pro Ala Thr Asn Asn Asn Tyr 
4555 4560 4565 



15814 



acg aca gac ate acg gtt agt gac cgc age aat agg gcg gta ctg age 
Thr Thr Asp lie Thr Val Ser Asp Arg Ser Asn Arg Ala Val Leu Ser 
4570 4575 4580 



15862 



acg ttg gcg gaa gtg ccg tea gat gtt gat atg ctg ttc agt gca gga 
Thr Leu Ala Glu Val Pro Ser Asp Val Asp Met Leu Phe Ser Ala Gly 
4585 4590 4595 4600 



X5910 



ggt cac cag aag cac ctg cag cc^ ggg caa gca ctg gtg tgg acg cea 
Gly His Oln Lys His Leu Gin Pro Gly Gin Ala Leu Val Trp Thr Pro 
4605 4610 4615 



15958 



cgt gga gaa ctg caa aag gtg aca ccg gtg gtg cgt gat ggg ggg gcg 
Arg Gly Glu Leu Gin Lys Val Thr Pro Val Val Arg Aap Gly Gly Ala 
4620 4625 4630 



16006 



gac gac age gaa age tat egg tat gat gcg ggc agt cag cgt att ate 
Asp Asp Ser Glu Ser Tyr Arg Tyr Asp Ala Gly Ser Gin Arg lie lie 
4635 4640 4645 



16054 



aaa acc ggc acg egg caa act ggc aac aac gtt cag aca cag egg gta 
Lys Thr Gly Thr Arg Gin Thr Gly Asn Asn Val Gin Thr Gin Arg Val 
4650, 4655 4660 



16102 



gtg tac ctg ccg ggg ctg gag tta cgt ate atg gca aat ggc gtg acg 
Val Tyr Leu Pro Gly Leu Glu Leu Arg He Met Ala Asn Gly Val Thr 
4665 4670 4675 4680 



16150 



gaa aaa gaa age ctg cag gtt att acg gtg ggc gag get ggg egg gca 
Glu Lys Glu Ser Leu Gin Val He Thr Val Gly Glu Ala Gly Arg Ala 
4685 4690 4695 



16196 
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caa gtg cgc gta ttg cac tgg gag ate ggc aag ccg gat gac etc gat 
Gin Val Arg Val Leu His Trp Glu lie Gly Lys Pro Asp Aep Leu Asp 
4700 4705 4710 



16246 



gag gac fcg gtg cgt tac agt tac gat aac ctg gtg ggc age age cag 
Glu Asp Ser Val Arg Tyr Ser Tyr Asp Asn Leu Val Gly Ser Ser Gin 
4715 4720 4725 



16294 



Ctg gag ctg gac aga gag ggt tac ctt ate agt gag gag gag ttc tac 
Leu Glu Leu Asp Arg Glu Gly Tyr Leu lie. Ser Glu Glu Glu Phe Tyr 
4730 4735 4740 



16342 



ccg tat ggc gga acg get gtt ctg acg gcg cga agt gag gtt gag get 
Pro Tyr Gly Gly Thr Ala Val Leu Thr Ala Arg Ser Glu Val Glu Ala 
4745 4750 4755 4760 



16390 



gac tac aaa act ate cga tac tea ggc aag gag cgt gac gcg acg ggg 
Asp Tyr Lys Thr lie Arg Tyr Ser Gly Lys Glu Arg Asp Ala Thr Gly 
4765 4770 4775 



16438 



ctg gat tat tac ggt tat egg tat tac cag cca tgg gca ggg cgc tgg 
Leu Asp Tyr Tyr Gly Tyr Arg Tyr Tyr Gin Pro Trp Ala Oly Airg Trp 
4760 4785 4790 



16486 



etc tec acg gac ccg gea ggc acg gtg gac ggg ctg aac ctg ttc cgc 
Leu Ser Thr Asp Pro Ala Gly Thr Val Asp Gly Leu Asn Ijcu Phe Arg 
4795 4800 4805 



16S34 



atg gtg egg aat aat ccc gtc acg ctg ttt gac age aac ggg egg ate 
Met Val Arg Asn Asn Pro Val Thr Leu Phe Asp Ser Asn Gly Arg lie 
4810^ 4815 4820 



16562 



agt act ggt cag gag gcc aga cga tta gtg ggg gaa gca ttt gtt cat 
Ser Thr Oly Gin Glu Ala Arg Arg Leu Val Gly Glu Ala Phe Val His 
4825 4830 4835 4840 



16630 



ccg tta cac atg cct gtt ttt gaa aga att tet gta gag aga aag att 
Pro Leu His Met Pro Val Phe Glu Arg lie Ser Val Glu Arg Lys lie 
4645 4850 4855 



16678 



tea atg age gta agg gaa get ggc att tat act att tea gcg ctg ggt 
Ser Met Ser Val Arg Glu Ala Gly lie Tyr Thr Xle Ser Ala I«eu Oly 
4860 4865 4870 



16726 



gaa ggt gca gca gca aaa ggc cat aat att eta gag aaa acc att aaa 
Glu Gly Ala Ala Ala Lys Gly His Asn He Leu Glu Lys Thr He Lys 
4875 4880 4885 



16774 



ccc ggt tec ctg aag get ate tat ggt gat aaa get gag tea att ctt 
Pro Gly Ser Leu Lys Ala He Tyr Oly Asp Lys Ala Glu Ser Zle I#eu 
4890 4895 4900 



16622 



gga ctg gea aaa cgt age ggt etc gtt ggc cga gta gga cag tgg gat 
Oly Leu Ala Lys Arg Ser Gly Leu Val G^y Arg Val Oly Gin Trp Asp 
4905 4910 4915 4920 



16670 



gca tea ggt gta cgt gga att tat gcg cac aac aga ccg ggt ggt gag 



16918 
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Ala Ser Gly Val Arg Gly lie Tyr Ala His Asn Arg Pro Gly Gly Glu 
4925 4930 4935 



gat ttg gtt tat cct gtc age ctg cag aat act tct gcc aat gaa att 
Asp Leu Val Tyr Pro Val Ser Leu Gin Asn Thr Ser Ala Asn Glu lie 
4940 4945 4950 



16966 



gtt aat gca tgg ata aaa ttt aaa ate ate acg ccc tac acc ggg gat 
Val Asn Ala Trp lie Lys Phe Lys He He Thr Pro Tyr Thr Gly Asp 
4955 4960 4965 



17014 



tat gac atg cac gat att att aaa ttc tct gat ggg aaa ggg cat gtg 
Tyr Asp Met His Asp He He Lys Phe Ser Asp Gly Lys Gly His Val 
4970 ^ 4975 4980 



17062 



cct aca gcg gaa agt agt gag gaa aga gga gta aaa gat eta att aat 
Pro Thr Ala Glu Ser Ser Glu Glu Arg Gly Val Lys Asp Leu He Asn 
4985 4990 4995 5000 



17110 



aaa ggt gtt gcg gag gtc gat cct tec aga ccc ttt gag tat aca gcg 
Lys Gly Val Ala Glu Val Asp Pro Ser Arg Pro Phe Glu Tyr Thr Ala 
5005 5010 5015 



17158 



atg aat gtt att cgc cat gga cca cag gtg aac ttt gtt ccc tat atg 
Met Asn Val He Arg His Gly Pro Gin Val Asn Phe Val Pro Tyr Met 
5020 5025 5030 



17206 



tgg gaa cat gag cac gat aaa gtc gtt aat gat aat ggt tat ctg ggg 
Trp Glu His Olu His Asp Lys Val Val Asn Asp Asn Gly Tyr Leu Gly 
5035 5040 5045 



17254 



gtg gta get age ccg ggg ccg ttc ccg gta gcg atg gta cat cag ggg 
Val Val Ala Ser Pro Gly Pro Phe Pro Val Ala Met Val His Gin Gly 
5050 5055 5060 



17302 



gaa tgg act gtt ttt gac aac agt gaa gaa ctg ttt aat ttc tat aaa 
Glu Trp Thr Val Phe Asp Asn Ser Glu Qlu Leu Phe Asn Phe Tyr Lys 
5065 5070 5075 5080 



17350 



tct aca aat aca cct ctt cct gaa cac tgg tec caa gat ttt atg gac 
Ser Thr Asn Thr Pro Leu Pro Glu His Trp Ser Gin Asp Phe Met Asp 
5085 5090 5095 



17398 



aga ggg aaa gga ata gtc gca act cct egg cat get gaa ctt ctt gat 
Arg Gly Lys Gly He Val Ala Thr Pro Arg His Ala Glu Leu Leu Asp 
5100 5105 5110 



17446 



aaa cga cga gtc atg tac taa tegtaacgat ttcctgcctt acccaaagta 
Lys Arg Arg Val Met Tyr 
5115 



17497 



tacagcccgg tgagacattt tctctgtctc atttgggttg tttttgtctc atctgeatgt 17557 
tatgtcttcc ctcatctaaa gtctaacgag acatttttag eaaaatggca ctttacggtt 17617 
atgttcgcgt ttcaaccgae ggtccggatt ttactetgta aatacagaca ettcgcgcag 17677 
cctgctgcga aattatccgt gcgaaaaaag ccagcggcag cagccgggat ggacgaaatg 17737 
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aactgcagct tctgctggct tttttgcggc caggcaacat gctgatggtt acgtgagttg 17797 
atcggctgcc accaaaaagt ccggagcgtg cggcccagat. cgccgcaata atactgctgt 17857 
atggtatttc catcaccact gtatatcgca cactctgggc cttccagaaa ccccataccg 17917 
cacaccggtg tgatcgctgg aagccccggg cattaccgcc gtctgtactc gaacactatt: 17977 
gtggacttga tggttaggag attgaatcga ccatttttga gatccctaac catagatcgt 18037 
agagttgcac actcccagat ggcgtggctt agcgagcgat tatgctt^aaa aattcatgtt 18097 
ttgctgtgtt tttaatccaa aacctgcttt tcaggcgcac ttatccagct acggggtctg 18157 
aagccatcgt ttttttgccg tacgatgtag cctgtcagag agcattcctg tggcgtgctc 18217 
gcccgctacg gtaccggcgg caaaacgcag ccggcctttg cagaggatgc actggtacgg 18277 
atcggtgccc aggaagcctt tcatcagcac cgegaacccg ggccgtttcg gtttctcccg 18337 
taccgtcatc tccagcgcgt cgtaaacctt cggcagcagc gtgcccgttt gcggttggcc 18397 
agaaaaccat agtaacgcac cattttaaaa tgccgtgcag ggatatggct gacgtaacgc 18457 
tgcagcatct cctcctggct gattttctgg cgtttgtgct gctgcgtacg gtgatcgtaa 18517 
tactgatgca ccacggcccc gccgcggtag tggcgtagct gagaagccgc caccggcggg 18577 
cgcttcaggt accgggccag gtatttcacg ctgcgccagg cgccgcgggt ctttttggca 18637 
aaattcactt fcccaggggcg gcggtattgc gcatgcaggg tcttcgttgc ggatatggcc 18697 
gagacccggc agggcgccag gattgatgcg cagcaggtga acgacggcat tgcgccagat 18757 
ggcttccacc tctttctttt taaagaacag ctgccgccag acgtggtgtt tgacgtcaag 18817 
accgccgcgg gtaacggaga cgtggatatg cggatgttga ttgagctgcc ggccttaggt 18877 
gtggagcgcg caaaaaatgc cggcctcgat gccctgccgg cgtgcccagc ggagcatggc 18937 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acid residues 
1 (B) TYPE: amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: PROTEIN (ORF 1) 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lye lie Ser Ser Arg Gly He Ala Leu He Lys Glu Phe Glu Gly 
15 10 15 

Leu Arg Leu His Ala Tyr Arg Cys Ala Ala Asp Val Trp Thr Val Gly 
20 25 30 

Tyr Gly Hie Thr Ala Gly val Thr Lys Gly Asp He He Thr Val Asp 
35 40 45 

Glu Ala Gin Thr Met Leu Thr Asn Asp He Thr Val Phe Glu Arg Ala 
50 55 60 

Val Ser Gin Ala Val Ala Val Pro Leu Asn Gin Ser Gin Tyr Asp Ala 
65 'JO 75 80 

Leu Val Ser Leu Val Phe Asn He Gly Gin Gly Asn Phe Lys Arg Ser 
85 90 55 

Thr LeuvLeu Lys Lys Leu Asn' Lys Gin Asp Tyr Val Gly Ala Gly Asn 
100 105 110 

Glu Phe Leu Arg Trp Thr Arg Ala Asn Gly Lys Val Leu Pro Gly Leu 
115 120 125 

He Arg Arg Arg Glu Ala Glu Arg Val Leu Phe Glu Lys Leu Gly 
130 135 140 

Ala 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: PROTEIN (ORF 2) 

(13^ SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

X,ys lie Hie Tyr Cln Val Ala Val Val Le« Thr Oly Val Met val 
^ Oly X-u ser His Trp Arg Tyr Tl.r Val Oly Tyr Hi. Ala Ala Asp 



35 " 



Thr Gin Trp Gin Gin Arg Gin Ala Glu Gin Gl« Arg Ala Asp Ala Leu 

SO 5^ 
^a Leu Leu Ala Ala Glu Thr Arg Glu Arg Lys Trp Glu Gin Gin Arg 

„ y ala Tie His Ala Glu Glu Glu Leu Ala 
Gin Thr ASP Met Asn Lys Val Ala lie Hxs Aia 

85 

;aa Ala Arg Asp Ala Ala Ala Asp Ala Gin Arg Thr Gly Gin Arg Leu 
100 

om H^s Thr val Thr Thr Leu Gin Arg Gin Leu Ala Ser Arg Glu Thr 

115 "° 

^ Leu ser Ala Ala Thr Ala Xle Gly Thr Asp Asp Leu Gly Gly 
130 

Gin pro Gly Val Leu Phe Ala Glu Leu Phe Arg Arg Ala Asp Gin Arg 

145 «0 

.Xa Gly Glu Leu Ala Ala Tyr Ala Asp Arg Thr Arg val Lys Trp Gin 

165 

;aa cys Gly Arg Ala Tyr Gin Ala Ala Thr His Glu Ala Glu Lys 



180 "5 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2376 amino acid residues 

(B) TYPE: amino acid 
CD) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: PROTEIN (SepA) 

<ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Arg Gin Asp He Met Tyr Asn He Asp Asp He Leu Glu Lys Val 



10 



Asn Ala Pro Arg Ala Arg Leu Ser Glu Glu Asn Asp Thr Ala Val Thr 



20 



25 



Leu 



Thr ASP Leu Phe Ser Arg ser Phe Pro Glu Val Lys Lys He Thr 
35 40 45 

Gly Asp ser Leu Ser Trp Gly Glu Val Cys Tyr Leu Tyr Ser Gin Ala 

50 S5 60 

Gin His Glu Gin Lys Glu Asn Arg Leu Thr Glu Ser Arg He Leu Ala 
65 ''^ 

Arg Ala Asn Pro Leu Leu Val Asn Ala Val Arg Leu Gly He Arg Gin 
85 90 35 

Asp Trp Phe Gly Ser Arg Ala Asp 
105 



Arg 



Ala Ala Gly Ser Arg Ser Tyr Asp 
100 

Phe Ala Arg Pro Gly Ser Val Ala Ser Met Phe Ser Pro Ala Ala 
115- 120 125 

Tyr Leu Thr Glu Leu Tyr Arg Glu Ala Lys Asp Leu His Pro Asp Thr 
130 135 1*0 

ser Leu Phe Arg Leu Asp He Arg Arg Pro Asp Leu Ala Ala Leu Ala 
145 150 155 

Leu ser Gin Asn Asn Met Asp Asp Glu Leu Ser Thr Leu Ser Leu Ser 
165 I'^O 1^^ 

Glu Leu Leu Tyr Arg Gly He Gly Ala Ala Glu Gly Leu Asp Asp 



Asn 

IBO 



185 190 
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Asp Ser val Arg Glu Leu Leu Ala Gly Tyr Arg Leu Thr Gly Leu Thr 
yXSS 200 205 

pro Tyr His Trp Ala Tyr Glu Ala Ala Arg Gin Ala He Leu Val Gin 
210 215 220 

Asp Pro Thr Leu Met Gly Phe Ser Arg Asn Pro Asp Val Ala Gin Leu 
225 230 235 240 

Met Asp Pro Ala Ser Met Leu Ala He Glu Ala Asp He Ser Pro Glu 
245 250 255 

Leu Tyr Gin He Leu Ala Glu Glu He Thr Thr Asp Ser Tyr Glu Ala 
260 265 270 

Leu Trp Ser Lys Asn Phe Gly Asp Met Pro Pro Ser Ser Leu Leu Ser 
275 280 285 

Tyr Asp Ala Leu Ala Thr Phe Tyr Asp Leu Asp Tyr Asp Glu Leu Thr 
290 295 300 

ser Leu Leu Ser Leu Arg Leu Asp Phe Ser Asn Pro Asn Asn Glu Tyr 
305 310 315 320 

Tyr He Asn Ser Gin Leu Ser Val Val Thr lieu Asn Glu Ser Thr Gly 
325 330 335 

Leu He Thr He His His Tyr Leu Arg Thr Leu Gly Gly Asp Ser Oln 
340 345 350 

Gin He Asn Pro Glu Leu He Pro Tyr Gly Asp Gly Thr Tyr Leu Tyr 
355 360 365 

Asn Phe Ser Val Val Ser Thr He Ser Glu Asp Ser Phe Lys Leu Gly 
370 375 380 

Ser Leu Gly Ser Asn Ser Ser Asn Leu Tyr Ser Gly Asp Tyr Gin Leu 
385 390 395 400 

Gin Lys Gly Val Arg Tyr Ser He Pro Val Glu He Asp Glu Gly Lys 
405 410 415 

Leu Asn Asp Gly He Thr He Gly Leu Ser Arg Lys Gly Gly Gly Tyr 
420 425 430 

Tyr ser Thr Val Asn Phe Thr Leu He Glu Tyr Asp Pro Ala He Phe 
435 440 445 

He Leu Lys Leu Asn Lys Val He Arg Leu Tyr Lys Ala Thr Gly Met 
450 455 460 

Thr Thr Ala Glu He Tyr Gin He Thr Asn He Leu Asn Asn Gly Leu 
465 * 470 475 460 

Thr He Asp His Ala Val Leu Ser Lys He Phe Leu Val Arg Tyr Leu 
485 490 495 



Met 



Arg His Tyr Gin Leu Asp Val Ala Arg Ser Leu He Leu Cys Asn 
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500 505 510 

Oly Thr He Ser Asp Gin Ala Phe Ser Gly Glu Thr Gly Leu Phe Thr 
515 520 525 

Thr Leu Phe Asn Thr Pro Pro Leu Asn Gly Gin Leu Phe Ser Ala Asp 
530 535 540 

ASP Thr Pro Leu Asp Leu Arg Ser Glu Ala Pro Glu Asp Ala Phe Arg 
545 ^ 550 555 560 

Leu ser Val Leu Lys Arg Ala Phe Asn He Ser Ala Ser Gly Leu Ser 
565 570 575 

Thr Leu Trp Gin Leu Ala Ser Gly Asp Ser Ser Ala Gly Phe Ser Cye 
580 585 590 

ser Ala Asp Asn He Ala Ala Leu Tyr Arg Val Lys Leu Leu Ala Asp 
595 600 605 

He His Asp Leu Ser Ala Gly Glu Leu Ser Met Leu Leu Ser Val Ser 
610 615 620 

Pro Phe Ser Gly Val Ala Ala Gly Ser Leu Ser Asp Asn Glu Leu Thr 
625 630 635 640 

Gin Phe Leu Tyr Oln Thr Thr Thr Trp Leu Thr Glu Qln Oly Trp Thr 
645 650 655 

val ser Asp Val Phe Leu Met Leu Thr Thr Gin Tyr Gly Thr Leu Leu 
660 665 670 

Thr pro Asp He Glu Asn Leu Leu Ala Ser Leu Arg Asn Gly Leu Ser 
675 680 685 

Gly Arg Olu Leu Phe Pro Glu Thr Leu Pro Gly Asp Gly Ala Pro Phe 
690 695 700 

He Ala Ala Ala Met Gin Leu Asp Ala Thr Asp Thr Ala Lys Ala Met 
705 710 715 720 

Leu Thr Trp Ala Asp Gin Leu Lys Pro Glu Gly Leu Thr Leu Thr Glu 
725 730 735 

Phe He Leu Leu val Met Asn Ala Ala Pro Asn Asp Glu Gin Ala Gly 
740 745 750 



Gin Met Ala Gly Phe Cys 



Gin Ala Leu Trp Gin Leu Ala Leu He He 
755 760 765 

Arg ser Thr Gly Leu Ser Thr Arg Glu Leu Thr Leu Leu Val Ser Gin 

780 



770 775 



Pro Gly Arg Phe Arg Thr Gly Trp His His Leu Pro His Asp Leu Pro 
785 790 795 800 

Ala Leu Arg Asp He Thr Arg Phe His Ala Val Val Asn Arg Ser Gly 
605 BIO 815 
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Ser His Ala Gly Glu Val Leu Thr Ala Leu Glu Thr Gly Glu Leu Ser 
820 825 830 

Ser Ala Leu Leu Ala Arg Ala Leu Ser Gin Asn Glu Gin Asp Val Thr 
835 840 845 

Gly Ala Leu Ala Gin Val Arg Gly Ala Gly Glu Gin Asp Asn Ser Val 
850 855 860 

Phe Thr Ser Trp Glu Glu Val Asp Gin Ala Glu Gin Trp Leu Asp Met 
865 870 875 880 

Ser Glu Thr Leu Ser He Thr Pro Ser Gly Leu Ala Ser Leu He Ala 
885 890 895 

Leu Lys Tyr He Asn Val Ser Asp Asp Ser Ala Pro Leu Tyr Ser Gin 
^ 900 905 910 

Trp Gin Val Val Ser Gly I-eu Leu Gin Ala Gly Leu Lys Ser Ser Gin 
915 920 925 

Ser ser Ala Leu His Asp Tyr Leu Glu Glu Gly Thr Ser Ser Ala Leu 
930 935 940 

Cys Ala Tyr Tyr Leu Arg Asn Leu Ala Pro Asn Met Val Ser Gly Arg 
945 950 955 960 

ASP Asp Leu Phe Gly Tyr Leu Leu Leu Asp Asn Gin Val Ser Ala Lys 
965 970 975 

Val Lys Thr Thr Arg He Ala Glu Ala He Ala Gly He Arg Leu Tyr 
980 985 990 

He Asn Arg Ala Leu Asn Gly He Glu Leu Ser Ala Met Ala Glu Val 
995 1000 1005 

Arg Gly Arg Gin Phe Phe Thr Asp Trp Asp Thr Phe Asn Lys Arg Tyr 
1010 1015 1020 

Ser Thr Trp Ala Gly Val Ser Glu Leu Val Tyr Tyr Pro Glu Asn Tyr 
025 1030 1035 1040 

Leu Asp Pro Thr Val Arg He Gly Gin Thr Gly Met Met Asp Thr Leu 
1045 1050 1055 

Leu Gin Ser Val Ser Gin Ser Ser He Asn Arg Asp Thr Val Glu Asp 
1060 1065 1070 

Ma Phe Lys Thr Tyr Leu Thr Thr Phe Glu Gin He Ala Asn Leu Asn 
1075 1080 1085 

Thr val Ser Gly Tyr His Asp Asn Ala Ser Met Thr Gin Oly Thr Thr 
1090 1095 1100 

Trp Tyr Val Gly Arg Ser He Thr Asp Gin Thr Asn Trp Tyr Trp Arg 
105 1110 1115 1120 
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Ser Ala Asn His Ser Lys lie Gin Asp Ser Met Met Pro Ala Asn Ala 
1125 1130 1135 

Trp Thr Gly Trp Thr Lys lie Asn Cys Gly Met Asn Pro Trp Ser Asp 
1140 11*5 1150 

Leu Val Cys Ser Val Phe Phe Asn Ser Arg Leu Tyr Val Val Trp Val 
1155 1160 1165 

Glu Glu Asn Gin Ser Ala Asp Thr Glu Ala Glu Ser Thr Thr Thr Thr 
1170 1175 1180 

Gin Gin Ser Tyr Thr Leu Lys Leu Ser Phe Arg Arg Tyr Asp Gly Thr 
155 1190 1195 1200 

Trp Ser Ser Pro Val Ser Phe Asp He Thr Gly Asn He Ala Phe Pro 
1205 1210 1215 

Glu Thr Gin Gly Met His Val Thr Cys Asn Pro Leu Thr Glu Gin Leu 
1220 1225 1230 

Tyr Cys Ala Phe Tyr Ser Val Thr Ser Lys Pro Asp Phe Asp Asn Ala 
1235 1240 1245 

Gin Leu lie Ser Val Asp Asn Asp Met Thr Leu Asn Val He Ser Asp 
1250 1255 1260 

He Gly He Phe Lys Ser Val Ser His Glu Phe Asn Thr Ser Thr Glu 
265 1270 1275 1280 

Lys Phe He Asn Asn Val Phe Ser Asp Pro Ser Ala Asn Tyr Phe Val 
1285 1290 1295 

Ser Ala Thr Ser Leu He Asp Asp Val He His Ser Asp Phe Ser Leu 
1300 1305 1310 

Leu Asn Ser Lys Thr Thr Ser Thr Val Phe Thr Asn Glu Asp Ser Ser 
1315 1320 1325 

Leu Leu Thr Pro Glu Leu His He Thr Ala Asn Val Ser Cys Phe Val 
1330 1335 1340 

Ser Thr Ala Gly He Ala Thr Gin Ser Thr He Glu Lyo Phe Val Gin 
345 1350 1355 1360 

Ala Gly He Glu Phe Glu Glu He Asn Phe Tyr Ala Gly Gin Ala Ala 
1365 1370 1375 

Gly Gly Phe Asp Gly Phe Val Gly Val Asp Val Ser Asn Ser Lys Val 
1380 1385 1390 

Tyr Gin Val Gly Lys Glu Ala Val Gly Val Thr Val Lys Ser Tyr Ser 
1395 1400 1405 

Val Thr Gly Val Ser Gly Ser Val Glu Leu Phe He Asp Ser Ser Asn 
1410 1415 1420 

Lys Tyr Phe Ser Gly He Leu Ser Asp Lys Met He Thr Ala Leu He 
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425 1430 1435 1440 

Ser Gly Ser Thr Ser Lys Val Asn Tyr Val Ser Ser He Gly Ser Gin 
1445 1450 1455 

ASP Phe Trp Ser Val Lys Ser Leu Met Pro Ala Leu Gin He Tyr Glu 
1460 1465 1470 

Leu He' Asp Asp He He Leu Thr Ser Gly Val Asn Gly Thr Glu He 
1475 1480 1485 

Lys Ser Trp Pro Ser Ala Glu Trp Tyr Asn Asp Lys Leu Ser Leu Gin 
14 90 1495 1500 

Ser Gly Asn Asn Leu Phe Asn Thr Lys Ser Leu Ser Phe Thr Vai Asn 
505 1510 1515 1520 

Thr Ser Asp He Val Glu Asp Glu Phe Asp Val Thr Phe Thr Phe Thr 
1525 1530 1535 

Ala Val Asp Gin Asn Asn Val Val Leu Ala Ala Arg Thr Ala He Leu 
1540 1545 1550 

Thr Val He Arg Asn He Asn Asn Asp Thr Ser Val He Ala Leu Arg 
1555 1560 1565 

Lys Asn Thr Arg Gly Ala Gin Tyr He Arg Phe Thr Ala Gly Asn Asp 
1570 1575 1580 

Val Ala Leu He Arg Leu Asn Thr Leu Phe Ala Arg Gin Leu Val Asp 
585 1590 1595 1600 

Arg Ala Asn Thr Gly He Asp Thr He Leu Ser Met Glu Thr Gin Arg 
1605 1610 1615 

Leu Thr Glu Pro Ala Leu Glu Glu Gly Ser Asp Val Phe Met Asp Phe 
1620 1625 1630 

ser Gly Ala Asn Ala Leu Tyr Phe Trp Glu Leu Phe Tyr Tyr Thr Pro 
1635 1640 1645 

Met Met Val Phe Gin Arg Leu Leu Gin Glu Gin His Phe Pro Glu Ala 
1650 1655 1660 

Thr Arg Trp Leu Gin Tyr Val Trp Asn Pro Ala Gly His Val Val Asn 
665 1670 1675 1680 

Gly Val Leu Gin Asn Tyr Thr Trp Asn Val Arg Pro Leu Glu Glu Asp 
1685 1690 1695 

Thr Gly Trp Asn Asp Ser Pro Leu Asp Ser He Asp Pro Asp Ala He 
1700 1705 1710 

Ala Oln Tyr Asp Pro Met His Tyr Lys Val Ala Thr Phe Met Ser Tyr 
1715 1720 1725 

Leu Asp Leu Leu He Ala Arg Gly Asp Ala Ala Tyr Arg Leu Leu Glu 
1730 1735 1740 



1^ 
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Arg Asp Thr Leu Asn Glu Ala Arg Met Trp Tyr Val Gin Ala Leu Asn 
745 1750 1755 1760 

Leu Leu Gly Asp Glu Pro Tyr lie Ser Phe Asp Ala Asp Trp Ser Ala 
1765 1770 1775 

Leu Thr Leu Gly Asp Ala Ala Ser Glu Val Thr Arg Arg Asp Tyr Gin 
1780 1785 1790 

Glu Ala Leu Leu Ala Val Arg Arg Leu Val Pro Ala Pro Glu Thr Arg 
1795 1800 1805 

Thr Ala Asn Ser Leu Thr Ala Leu Phe Leu Pro Gin Gin Asn Glu Val 
1810 1815 1820 

Leu Lys Gly Tyr Trp Gin Thr Leu Ala Gin Arg Leu Hie Asn Leu Arg 
825 1830 1835 1840 

Hie Asn Leu Ser lie Asp Gly Gin Pro Leu Ser Leu Ser Val Tyr Ala 
1845 1850 1855 

Thr Pro Ser Glu Pro Ser Ala Leu Gin Ser Ala Val Val Asn Ser Ala 
1860 1865 1870 

Gin Gly Ala Ala Ala Leu Pro Ala Ala Val Met Pro Leu Tyr Ser Phe 
1875 1880 1885 

Pro Val Met Leu Glu Asn Ala Arg Gly Met Val Ser Leu Leu Thr Gly 
1890 1895 1900 

Phe Gly Asn Thr Leu Leu Gly lie Thr Glu Arg Gin Asp Ala Glu Ala 
905 1910 1915 1920 

Leu Ala Lye Leu Leu Gin Thr Gin Gly Ser Glu Leu lie Arg Gin Gly 
1925 1930 1935 

Leu Arg Gin Gin Asp Asn Val Leu Glu Glu He Asp Ala Asp He Ala 
1940 1945 1950 

Ala Leu V Glu Glu Ser Arg Arg Gly Ala Gin Met Arg Phe Glu Arg Tyr 
1955 I960 1965 

Lys Val Leu Tyr Glu Ala Asp Val Asn Thr Gly Glu Lys Gin Ala Met 
1970 1975 i960 

Asp Leu Tyr Leu Ser Ser Ser Val Leu Ser Ala Ser Thr Ala Ala Leu 
985 1990 1995 2000 

Phe Leu Ala Glu Ala Ala Ala Asp Met Leu Pro Asn He Tyr Gly Leu 
2005 2010 2015. 

Ala Val Gly Gly Ser Arg Tyr Gly Ala Leu Phe Lys Ala Thr Ala He 
2020 2025 2030 

Gly He Gin Val Ser Ser Asp Ala Thr Arg He Ser Ala Asp Lys He 
2035 2040 2045 
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Ser Gin Ser Glu Val Tyr Arg Arg Arg Arg Glu Glu Trp Glu lie Gin 
2050 2055 2060 

Arq ASP Ser Ala Gin Ser Asp Val Ala Gin He Asp Ala Gin Leu Ala 
065 2070 2075 2080 

Ala Met Ala Val Arg Arg Glu Gly Ala Glu Leu Gin Lye Thr Tyr Leu 
2085 2090 2095 

Glu Thr Gin Gin Thr Gin Ala Gin Ala Gin Leu Ala Phe Leu Gin Ser 
2100 2105 2110 

Lys Phe Asn Asn Thr Ala Leu Tyr Ser Trp Leu Arg Gly Arg Leu Ser 
2115 2120 2X25 

Ala He Tyr Tyr Gin Phe Tyr Asp Leu Ala Val Ser Arg Cys Leu Met 
2130 2135 2140 

Ala Gin Gin Ala Trp Gin Trp Asp Lys Phe Glu Thr Arg Ser Phe He 
145 2150 2155 2160 

Gin Pro Gly Ala Trp Met Gly Ala Asn Ala Gly Leu Leu Ala Gly Glu 
2165 2170 2175 

Thr Leu Met Leu Asn Leu Ala Gin Met Glu Gin Ala Trp Leu Thr Gly 
2180 2185 2190 

Asp Glu Arg Ala He Glu Val Thr Arg Thr Val Cys Leu Ser Glu Val 
2195 2200 2205 

Tyr Thr Ser Leu Ala Glu Asp Ala Ala Phe Ser Leu Ala Asp Lys Val 
2210 2215 2220 

val Glu Leu Val Ser Asn Gly Ser Gly Ser Ala Gly Thr Lys Ser Asn 
225 2230 2235 2240 

Gly Leu Gin Met Asp Gin Gin Gin Leu Glu Ala Thr Leu Lys Leu Ala 
2245 2250 2255 

ASP Leu Gly He Gly Asn Asp Tyr Pro Val Ser Leu Gly Thr Met Arg 
2260 2265 2270 

Arg He Lys Gin He Ser Val Thr Leu Pro Ala Leu Val Gly Pro Tyr 
2275 2280 2285 

Gin Asp Val Arg Ala Val Leu Ser Tyr Gly Gly Ser Met Val Met Pro 
2290 2295 2300 

Arg GIV Cys Ser Ala Leu Ala Val Ser His Gly Met Asn Asp Ser Gly 
305 2310 2315 2320 

Gin Phe Gin Leu Asp Phe Asn Asp Pro Arg Tyr Leu Pro Phe Glu Gly 
2325 2330 2335 

Leu Pro Val Asp Asp Thr Gly Thr Leu Thr Leu Ser Phe Pro Asp Ala 
2340 2345 "50 

Asp Gly Lys Gin Gin Ala Met Leu Leu Ser Leu Ser Asp He He Leu 
2355 2360 2365 

His He Arg Tyr Thr He He Ser 
2370 2375 
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<2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1429 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: PROTEIN (SepB) 

(ix) SEQtJENCE DESCRIPTION: SEQ ID NO: 5: 



Met Gin Asn His Gin Asp Met Ala 
1 5 

Gly Gly Gly Ala Val Thr Gly Leu 
20 

Pro As^ Gly Ala Ala Thr Leu Ser 
35 40 

Arg Gly Tyr Ala Pro Thr Gly Ala 
50 55 



lie Thr Ala Pro Thr Leu Pro Ser 
10 15 

Lys Gly Asp lie Ala Ala Ala Gly 
25 30 

lie Pro Leu Pro Val Ser Pro Gly 
45 

Leu Asn Tyr His Ser Arg Ser Gly 
60 
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Asn Gly Pro Phe Gly He Gly Trp Gly He Gly Gly Ala Ala Val Gin 
65 70 75 80 

Arg Arg Thr Arg Asn Gly Ala Pro Thr Tyr Asp Asp Thr Asp Glu Phe 
85 90 95 

Thr Gly Pro Asp Gly Glu Val Leu Val Pro Ala Leu Thr Ala Ala Gly 
100 105 110 

Thr Gin Glu Ala Arg Gin Ala Thr Ser Leu Leu Gly He Asn Pro Gly 
115 120 125 

Gly ser *Phe Asn Val Gin Val Tyr Arg Ser Arg Thr Glu Gly Ser Leu 
130 135 140 

ser Arg Leu Glu Arg Trp Leu Pro Ala Asp Glu Thr Glu Thr Glu Phe 
145 150 155 160 

Trp Val Leu Tyr Thr Pro Asp Oly Gin Val Ala Leu Leu Gly Arg Asn 
165 1-70 175 

Ala Gin Ala Arg lie Ser Asn Pro Thr Ala Pro Thr Gin Thr Ala Val 
180 185 190 

Trp Leu Met Glu Ser Ser Val Ser Leu Thr Gly Glu Gin Met Tyr Tyr 
195 . 200 205 

Gin Tyr Arg Ala Glu Asp Asp Asp Gly Cys Asp Glu Ala Glu Arg Asp 
210 215 220 

Ala His Pro Gin Ala Oly Ala Gin Arg Tyr Pro Val Ala Val Trp Tyr 
225 230 235 240 

Gly Asn Arg Gin Ala Ala Arg Thr Leu Pro Ala Leu Val Ser Thr Pro 
245 250 255 

ser Met Asp Ser Trp Leu Phe He Leu Val Phe Asp Tyr Gly Glu Arg 
1 260 265 270 

ser Ser Val Leu Ser Olu Ala Pro Ala Trp Oln Thr Pro Gly Ser Gly 
275 280 285 

Glu Trp Leu Cys Arg Gin Asp Cys Phe Ser Gly Tyr Glu Phe Gly Phe 
290 295 300 

Asn Leu Arg Thr Arg Arg Leu Cys Arg Gin Val Leu Met Phe His Tyr 
305 310 315 320 

Leu Gly Val Leu Ala Gly Ser Ser Gly Ala Asn Asp Ala Pro Ala Leu 
325 330 335 

He Ser Arg Leu Leu Leu Asp Tyr Arg Glu Ser Pro Ser Leu Ser Leu 
340 345 350 

Leu Glu Asn val His Gin Val Ala Tyr Glu Ser Asp Gly Thr Ser Cys 
355 360 365 
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• Ala Leu pro Ala Leu Ala Leu Gly Trp Glxi Thr Phe Thr Pro Pro Thr 

370 375 380 

Leu Ser Ala Trp Gin Thr Arg Asp Asp Met Gly Lys Leu Ser Leu Leu 
385 390 395 400 

Gin pro Tyr Gin Leu Val Asp Leu Asn Gly Glu Gly Val Val Gly He 
405 410 415 

Leu Tyr Gin Asp Ser Gly Ala Trp Trp Tyr Arg Glu Pro Val Arg Gin 
420 425 430 

Ser Gly Asp Asp Pro Asp Ala Val Thr Trp Gly Ala Ala Ala Ala Leu 
435 440 445 

Pro Thr Met Pro Ala Leu His Asn Ser Gly He Leu Ala Asp Leu Asn 
450 455 460 

Gly Asp Gly Arg Leu Glu Trp Val Val Thr Ala Pro Gly Val Ala Gly 
465 470 475 480 

Met Tyr Ksp Arg Thr Pro Gly Arg Asp Trp Leu His Phe Thr Pro Leu 
485 490 495 

ser Ala Leu Pro Val Glu Tyr Ala His Pro Lys Ala Val Leu Ala Asp 
500 505 510 

He Leu Gly Ala Gly Leu Thr Asp Met Val Leu He Gly Pro Arg Ser 
515 520 525 

Val Arg Leu Tyr Ser Gly Lys Asn Asp Gly Trp Asn Lys Gly Glu Thr 
530 535 540 

Val Gin Gin Thr Glu Arg Leu Thr Leu Pro Val Pro Gly Val Asp Pro 
545 550 555 560 

Arg Thr Leu Val Ala Phe Ser Asp Met Ala Gly Ser Gly Gin Gin His 
565 570 575 

Leu Thr Glu Val Arg Ala Aen Gly Val Arg Tyr Trp Pro Asn Leu Gly 
580 585 590 

His Gly Arg Phe Gly Gin Pro Val Asn He Pro Gly Phe Ser Oln Ser 
595 600 €05 

Val Thr Thr Phe Asn Pro Asp Gin He Leu Leu Ala Asp Thr Asp Gly 
610^ 615 620 

ser Gly Thr Thr Asp Leu He Tyr Ala Met Ser Asp Arg Leu Val He 
625 630 635 640 

Tyr Phe Asn Gin Ser Gly Asn Tyr Phe Ala Glu Pro His Thr Leu Leu 
645 650 655 

Leu Pro Lys Gly Val Arg Tyr Asp Arg Thr Cys Ser Leu Gin Val Ala 
660 665 670 

Asp He Gin Gly Leu Gly Val Pro Ser Leu Leu Leu Thr Val Pro His 
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675 



680 



val Ala Pro His His Trp Val Cys His Leu Ser Ala Asp Lys Pro Trp 
690 ^ 695 700 

Leu Leu Asn Gly Met Asn Asn Asn Met Gly Ala Arg His Ala Leu His 



705 



710 715 720 



TVr Arg Ser Ser Val Gin Phe Trp Leu Asp Glu Lys Ala Glu Ala Leu 
725 730 735 

Ala Ala Gly Ser Ser Pro Ala Cys Tyr Leu Pro Phe Thr Leu His Thr 
740 745 750 

Leu Trp Arg Ser Val Val Oln Asp Glu lie Thr Gly Asn Arg Leu Val 
755 760 765 

ser Asp Val Leu Tyr Arg His Gly Val Trp Asp Gly Oln Glu Arg Glu 
770 775 780 

Phe Arg Gly Phe Gly Phe Val Glu lie Arg Asp Thr Asp Thr Leu Ala 
785 790 



795 800 



Ser Gin Gly Thr Ala Thr Glu Leu Ser Met Pro Ser Val Ser Arg Asn 
805 810 815 

TrD Tyr Ala Thr Gly Val Pro Ala Val Asp Glu Arg Leu Pro Glu Thr 
820 825 830 

Tyr Trp GlA Asn Asp Ala Ala Ala Phe Ala Asp Phe Ala Thr Arg Phe 
835 840 845 

Thr Val Gly Ser Gly Glu Asp Glu Gin Thr Tyr Thr Pro Asp Asp Ser 
850 855 860 

LVB Thr Phe Trp Leu Oln Arg Ala Leu Lys Gly He Leu Leu Arg Ser 
865 870 875 880 

Glu Leu Tyr Gly Ala Asp Gly Ser Ser Oln Ala Asp He Pro Tyr Ser 
885 890 895 

Val Thr Glu Ser Arg Pro Gin Val Arg Leu Val Glu Ala Asn Gly Asp 
900 905 910 

Tyr Pro Val Val Trp Pro Met Gly Ala Glu Ser Arg Thr Ser Val Tyr 
915 920 925 

Glu Arg Tyr His Asn Asp Pro Gin Cys Gin Gin Gin Ala Val Leu Leu 
930 935 940 

Glu Tyr Gly Phe Pro Leu Arg Gin Val Ser Val Asn Tyr Pro 



Ser Asp 
945 950 



955 960 



Arg Arg Pro Pro Ser Ala Asp Asn Pro Tyr Pro Ala Ser Leu Pro Ala 
\ 965 970 975 

Thr Leu Phe Ala Asn Ser Tyr Asp Glu Gin Gin Gin He Leu Arg Leu 
980 985 990 
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Gly Leu Gin Gin Ser Ser Ala His His Leu Val Ser Leu Ser Glu Gly 
995 1000 1005 

His Trp Leu Leu Gly Leu Ala Glu Ala Ser Arg Asp Asp val Phe Thr 
1010 1015 1020 

Tvr Ser Ala Asp Asn Val Pro Glu Gly Gly Leu Thr Leu Glu His Leu 
025 1030 1035 1040 

ueu Ala Pro Glu Ser Leu Val Ser Asp Ser Gin Val Gly Thr Leu Ala 
1045 1050 1055 

Gly Gin Gin Gin Val Trp Tyr Leu Asp Ser Gin Asp Val Ala Thr Val 
1060 1065 1070 

Ala Ala Pro Pro Leu Pro Pro Lys Val Ala Phe lie Glu Thr Ala Val 
1075 1080 1085 

Leu Asp Glu Gly Met Val Ser Ser Leu Ala Ala Tyr He Val Asp Glu 
1090 1095 1100 

Hie Leu Glu Gin Ala Gly Tyr Arg Gin Ser Gly Tyr Deu Phe Pro Arg 
105 1110 1115 1120 

Gly Arg Glu Ala Glu Gin Ala Leu Trp Thr Gin Cys Gin Gly Tyr Val 
1125 1130 1135 

Thr Tyr Ala Gly Ala Glu His Phe Trp Leu Pro Leu Ser Phe Arg Asp 
1140 11*5 1150 

Ser Met Leu Thr Gly Pro Val Thr Val Thr Arg Asp Ala Tyr Asp Cys 
1155 1160 1165 

Val He Thr Gin Trp Gin Asp Ala Ala Gly He Val Thr Thr Ala Asp 
1170 1175 . 1180 

Tyr A^ Trp Arg Phe Leu Thr Pro Val Arg val Thr Asp Pro Asn Asp 
185 1150 1195 1200 

Asn Leu Gin Ser Val Thr Leu Asp Ala Leu Gly Arg Val Thr Thr Leu 
1205 1210 1215 

Arg Phe Trp Gly Thr Glu Asn Gly He Ala Thr Gly Tyr Ser Asp Ala 
]L220 1225 1230 

Thr Leu Ser Val Pro Asp Gly Ala Ala Ala Ala Leu Ala Leu Thr Ala 
1235 1240 1245 

pro Leu Pro Val Ala Gin Cys Leu Val Tyr Val Thr Asp Ser Trp Gly 
1250 1255 1260 

ASP Asp Asp Asn Glu Lys Met Pro Pro His Val Val Val Leu Ala Thr 
265 1270 1275 1280 

ASP Arg Tyr Asp Ser Asp Thr Gly Gin Gin Val Arg Gin Gin Val Thr 
1285 1290 1295 
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Phe Ser Asp Gly Phe Gly Arg Glu Leu Gin Ser Ala Thr Arg Gin Ala 
1300 1305 1310 

Glu Gly Asn Ala Trp Gin Arg Gly Arg Asp Gly Lys Leu Val Thr Ala 
1315 1320 1325 

ser Asp Gly Leu Pro Val Thr Val Ala Thr Asn Phe Arg Trp Ala Val 
1330 1335 1340 

Thr Gly Arg Ala Glu Tyr Asp Asn Lys Gly Leu Pro Val Arg Val Tyr 
345 1350 1355 1360 

Gin Pro Tyr Phe Leu Asp Ser Trp Gin Tyr Val Ser Asp Asp Ser Ala 
1365 1370 1375 

Arg Gin Asp Leu Tyr Ala Asp Thr His Phe Tyr Asp Pro Thr Ala Arg 
1380 1385 1390 

Glu Trp Gin Val He Thr Ala Lys Gly Glu Arg Arg Gin Val Leu Tyr 
1395 1400 1405 

Thr Pro Trp Phe Val Val Ser Glu Asp Glu Asn Asp Thr Val Gly Leu 
1410 1415 1420 



Asn Asp Ala Ser 
425 
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(2) INFORMATION FOR SEQ ID NO: 6: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 973 amino acid residues 

(B) TYPE: amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: PROTEIN (SepC) 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Ser Thr Ser Leu Phe Ser Ser Thr Pro Ser Val Ala Val Leu Asp 
15 10 15 

Asn Arg Gly Leu Leu Val Arg Glu Leu Gin Tyr Tyr Arg His Pro Asp 
20 25 30 

Thr Pro Glu Glu Thr Asp Glu Arg He Thr Cys His Gin His Asp Glu 
35 40 45 

Arg Oly Ser Leu Ser Gin Ser Ala Asp Pro Arg Leu His Ala Ala Gly 
50 55 60 

Leu Thr Asn Phe Thr Tyr Leu Asn Ser Leu Thr Qly Thr Val Leu Gin 
65 70 75 80 

ser val Ser Ala Asp Ala Gly Thr Ser Leu Glu Leu Ser Asp Ala Ala 
85 90 95 

Gly Arg Ala Phe Leu Ala Val Thr Gly Ala Gly Thr Glu Asp Ala Val 
100 105 110 

Thr Arg Thr Trp Gin Tyr Glu Asp Asp Thr Leu Pro Gly Arg Pro Leu 
115 120 125 

Ser He Thr Glu Gin Val Thr Gly Glu Ala Ala Gin He Thr Glu Arg 
130 135 140 

Phe val Tyr Ala Gly Asn Thr Asp Ala Glu Lys He Leu Asn Leu Ala 
145 150 155 160 

Gly Gin Cys val Ser His Tyr Asp Thr Ala Gly Leu Val Gin Thr Asp 
165 170 175 

ser He Ala Leu Ser Gly Val Pro Leu Ala Val Thr Arg Gin Leu Leu 
180 185 190 

Pro Asp Ala Ala Gly Ala Asn Trp Met Gly Glu Asp Ala Ser Ala Trp 
195 200 205 

Asn Asp Leu Leu Asp Gly Glu Thr Phe Phe Thr Gin Thr His Ala Asp 
2\0 215 220 

Ala Thr Gly Ala val Leu Ser He Thr Asp Ala Lye Gly Asn Leu Gin 
225 230 235 240 
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Arg val Ala Tyr Asp Val Ala Gly Leu Leu Ser Gly Ser Trp Leu Thr 
245 250 255 

Leu Lys Asp Gly Thr Glu Gin Val He Val Ala Ser Leu Thr Tyr Ser 
260 265 210 

Ala Ala Gly Lys Lys Leu Arg Glu Glu His Gly Asn Gly Val Val Thr 
275 280 285 

ser Tyr He Tyr Glu Pro Glu Thr Gin Arg Leu Thr Gly He Lys Thr 
290 300 

Glu Arg Pro Ser Gly His Val Ala Gly Ala Lys Val Leu Gin Asp Leu 
305 310 315 320 

Arg Tyr Thr Tyr Asp Pro Val Gly Asn Val Leu Ser Val Asn Asn Asp 
325 330 335 

Ala Glu Glu Thr Arg Phe Trp Arg Asn Gin Lys Val Val Pro Glu Asn 
340 345 350 

Thr Tyr He Tyr Asp Ser Leu Tyr Gin Leu Val Ser Ala Thr Gly Arg 
355 360 365 

Glu Met Ala Asn Ala Gly Oln Gin Gly Asn Asp Leu Pro Ser Ala Thr 
370 375 380 

Ala Pro Leu Pro Thr Asp Ser Ser Ala Tyr Thr Asn Tyr Thr Arg Thr 
385 390 395 400 

Tyr Arg Tyr Asp Arg Gly Gly Asn Leu Thr Oln Met Arg His Ser Ala 
405 410 415 

Pro Ala Thr Asn Asn Asn Tyr Thr Thr Asp He Thr Val Ser Asp Arg 
420 425 430 

Ser ^n Arg Ala Val Leu Ser Thr Leu Ala Glu Val Pro Ser Asp Val 
435 440 445 

ASP Met Leu Phe Ser Ala Gly Gly His Oln Lys His Leu Gin Pro Gly 
450 455 460 

Gin Ala Leu Val Trp Thr Pro Arg Gly Glu Leu Oln Lys Val Thr Pro 
465 470 475 480 

val Val Arg Asp Gly Gly Ala Asp Asp Ser Glu Ser Tyr Arg Tyr Asp 
485 490 495 

Ala Gly ser Gin Arg He He Lys Thr Gly Thr Arg Gin Thr Oly Asn 
500 505 510 

Asn val Gin Thr Gin Arg Val Val Tyr Leu Pro Gly Leu Glu Leu Arg 
515 520 525 

He Met Ala Asn Gly Val Thr Glu Lys Glu Ser Leu Oln Val He Thr 
530 535 540 

Val Gly Glu Ala Gly Arg Ala Oln Val Arg Val Leu Hie Trp Glu He 
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545 550 555 560 

Gly Lys Pro Asp Asp Leu Asp Glu Asp Ser Val Arg Tyr Ser Tyr Asp 
565 570 575 

Asn Leu Val Gly Ser Ser Gin Leu Glu Leu Asp Arg Glu Gly Tyr Leu 
580 585 590 

lie Ser Glu Glu Glu Phe Tyr Pro Tyr Gly Gly Thr Ala Val Leu Thr 
595 600 605 

Ala Arg Ser Glu Val Glu Ala Asp Tyr Lys Thr lie Arg Tyr Ser Gly 
610 615 620 

Lvs Glu Arg Asp Ala Thr Gly Leu Asp Tyr Tyr Gly Tyr Arg Tyr Tyr 
625 630 635 640 

Gin Pro Trp Ala Gly Arg Trp Leu Ser Thr Asp Pro Ala Gly Thr Val 
645 650 655 

ASP Gly Leu Asn Leu Phe Arg Met Val Arg Asn Asn Pro Val Thr Leu 
660 665 670 

Phe Asp ser Asn Gly Arg lie Ser Thr Gly Gin Glu Ala Atg Arg Leu 
675 680 685 

Val Gly Glu Ala Phe Val His Pro Leu His Met Pro Val Phe Glu Arg 
690 695 700 

He Ser Val Glu Arg Lys He Ser Met Ser Val Arg Glu Ala Gly He 
705 710 715 720 

Tvr Thr He Ser Ala Leu Gly Glu Gly Ala Ala Ala Lys Gly His Asn 
725 730 735 

He Leu Glu Lye Thr He Lys Pro Gly Ser Leu Lys Ala He Tyr Gly 
740 745 750 

ASP LYB Ala Glu Ser He Leu Gly Leu Ala Lys Arg Ser Gly Leu VaX 
^ 755 760 765 

Gly Arg Val Gly Gin Trp Asp Ala Ser Gly Val Arg Gly He Tyr Ala 
770 775 ''80 

His Asn Aig Pro Gly Gly Glu Asp I^u Val Tyr Pro Val Ser Leu Gin 
785 790 795 600 

Asn Thr Ser Ala Asn Glu He Val Asn Ala Trp He Lys Phe Lys He 
805 «l-0 815 

He Thr Pro Tyr Thr Gly Asp Tyr Asp Met His Asp He He Lys Phe 
820 825 830 

ser ASP Gly Lys Gly His Val Pro Thr Ala Glu Ser Ser Glu Glu Arg 
835 840 845 

Gly val Lys Asp Leu He Asn Lys Gly Val Ala Glu Val Asp Pro Ser 
850 855 860 
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Arg Pro Phe Glu Tyr Thr Ala Met Asn Val He Arg His Gly Pro Gin 
865 870 875 880 

val Asn Phe Val Pro Tyr Met Trp Glu His Glu His Aep Lys Val Val 
885 890 695 

Asn PiBp Asn Gly Tyr Leu Gly Val Val Ala Ser Pro Gly Pro Phe Pro 
900 905 910 

Val Ala Met Val His Gin Gly Glu Trp Thr VaX Phe Asp Asn Ser Glu 
915 920 925 

Glu Leu Phe Asn Phe Tyr Lys Ser Thr Asn Thr Pro Leu Pro Glu His 
930 935 940 

Trp Ser Gin Asp Phe Met Asp Arg Gly Lys Gly He Val Ala Thr Pro 
945 950 955 960 

Arg His Ala Glu Leu Leu Asp Lys Arg Arg Val Met Tyr 



965 



970 
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ggatccgagt gaaggaatca tcggccgctt tatacgttitic agggtgaata cggttggccg 60 
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aattgattgt ttctgaaaaa attaattgca cctctgccac ttatcagata aaaacacccc 660 
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Present claim 17 relates to a ligand defined by reference to a desirable 
characteristic or property, namely binding to the polypeptide of claim 
15- 
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reference to a result to be achieved. 
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international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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The present applicant has now found that three regions of the pADAP plasmid are required 
for full insecticidal function. Sequence analysis of these three regions has shown that the 
present applicant has isolated and identified a novel toxin from Serratia species that 
5 belongs to a new family of insecticidal toxins. It is broadly to this toxin that the present 
invention is directed. 

Disclosure of Invention 

According to a first aspect of the present invention, there is provided an isolated nucleic 
acid molecule comprising a nucleotide sequence of SEQ ED NO: 1 which encodes an 
10 insecticidal protein complex, or a functional fragment, neutral mutation, or homolog 
thereof capable of hybridising with said nucleic acid molecule under standard hybridisation 
conditions. 

The invention also provides an isolated nucleic acid molecule comprising the nucleotide 
sequence 1995-18937 of SEQ ID NO: 1 which encodes an insecticidal protein complex, or 
15 a functional fragment, neutral mutation, or homolog thereof capable of hybridising with 
said nucleic acid molecule under standard hybridisation conditions. 

The invention also provides an isolated nucleic acid molecule comprising one or more of 
the nucleotide sequences 241 1-9547, 9589-13883 or 14546-17467 of SEQ ID NO: 1 which 
encode insecticidal proteins, or a functional fragment, neutral mutation, or homolog thereof 
20 capable of hybridising with said nucleic acid molecule under standard hybridisation 
conditions. 

Preferably the nucleic acid molecule comprises all of nucleotide sequences 2411-9547, 
9598-13884 and 14546-17467 of SEQ ID NO: 1. 
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The invention furthe^Wates to an isolated nucleic acid molecule comprising a sequence of 
SEQ ID NO: 1, nucleotides 1955-18937 of SEQ ID NO: 1 or one or more of nucleotides 



further nucleotide sequence which encode an insecticidal protein. For example, the at least 
5 one further nucleotide sequence may be the nucleotide sequence which codes for the 
Bacillus delta endo toxins, vegatative insecticidal proteins (vips), cholesterol oxidases, 
Clostridium bifermentens mosquitocidal toxins and/or Photcrhabadus luminescens toxins 
and so forth. 

The nucleic acid molecule may comprise DNA, cDNA or RNA. 

10 Preferably said fragment, neutral mutation or homolog thereof is capable of hybridising to 
said nucleic acid molecule under stringent hybridisation conditions. 



The invention further relates to nucleic acid molecules which hybridise to the nucleotide 
sequence of SEQ ID NO: 1, or nucleotides 1955-18937, 241 1-9547, 9598-13884 or 14546- 
17467 of SEQ ID NO: 1 if there is at least 50%, preferably 60%, more preferably 70% and 
15 most preferably 90-95% or greater identity between the sequences. 

The nucleic acid molecule may be isolated from Serratia entomophila or Serratia 
proteamaculans strains. 

Also provided by the present invention are recombinant expresision vectors containing the 
nucleic acid molecule of the invention and hosts transformed with the vector of the 
20 invention capable of expressing a polypeptide of the invention. 

The vector may be selected from any suitable natural or artificial plasmid/vector. For 
example, pUC 19 (Yannish-Perron et al. 1995), pProEX HT (GibcoBRL, Gaithersburg, 
MD, USA), pBR322 (Bolivar et al. 1977), pACYC184 (Chang et aL 1978), pLAFR3 
(Staskowicz et al. 1987), and so forth, 



241 1-9547, 9598-13884 or 14546-17467 of SEQ ID NO: 1, operably linked to at least one 
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The Claims Defining the Invention are; 

1 . A purified and isolated nucleic acid molecule comprising a nucleotide sequence of SEQ ID 
NO: 1 that encodes at least one of: 

(i) an insecticidai protein complex, or 

(ii) a functional fragment of said complex, or 

(iii) a neutral mutation of said complex, or 

(iv) a homolog of said complex, 

each of which are capable of hybridising with said nucleic acid molecule under standard 
hybridisation conditions. 

2. A purified and isolated nucleic acid molecule as claimed in Claim 1 comprising the 
nucleotide sequence 1995- 1 8937 of SEQ ID NO: 1 . 

3. A purified and isolated nucleic acid molecule as claimed in Claim 1 comprising one or 
more of the nucleotide sequences 241 1-9547, 9589-13883 or 14546-17467 of SEQ ID NO: 
1. 

4. A purified and isolated nucleic acid molecule as claimed in Claim 3 comprising all of 
nucleotide sequences 241 1-9547, 9598-13884 and 14546-17467 of SEQ ID NO: 1. 

5. A purified and isolated nucleic acid molecule as claimed in Claim 1 comprising a sequence 
of SEQ ID NO: 1, operably linked to at least one further nucleotide sequence which 
encode an insecticidai protein. 

6. A purified and isolated nucleic acid molecule as claimed in Claim 2 comprising 
nucleotides 1955-18937 of SEQ ID NO: 1, operably linked to at least one further 
nucleotide sequence which encode an insecticidai protein. 
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7. A purified and isolated nucleic acid molecule as claimed in Claim 3 comprising a sequence 
of SEQ ID NO: l,or one or more of nucleotides 241 1-9547, 9598-13884 or 14546-17467 
of SEQ ID NO: 1, operably linked to at least one further nucleotide sequence which 
encode an insecticidal protein. 

8. A purified and isolated nucleic acid molecule as claimed in any one of claims 4 through 6 
wherein the said nucleotide sequence includes the nucleotide sequence which codes for at 
least one of the Bacillus delta endo toxins, vegatative insecticidal proteins (vips), 

„ cholesterol oxidases, Clostridium bifermentens mosquitocidal toxins and/or 
Photorhabadus luminescens toxins. 

9. A purified aVid isolated nucleic acid molecule as claimed in claim 1 wherein nucleic acid 
molecule may comprise DNA, cDNA or RNA, 

10. A purified and isolated nucleic acid molecule as claimed in claim 1 wherein the nucleic 
acid molecules said fragment, neutral mutation or homolog thereof capable of hybridising 
to said nucleic acid molecule, hybridise to the nucleotide sequence of SEQ ID NO: 1, or 
nucleotides 1955-18937, 2411-9547, 9598-13884 or 14546-17467 of SEQ ID NO: 1 if 
there is at least 50%, preferably 60%, more preferably 70% and most preferably 90-95% or 
greater identity between the sequences. 

11. A purified and isolated nucleic acid molecule as claimed in claim 1 wherein the nucleic 
acid molecule may be isolated from Serratia entomophila or Serratia proteamaculans 
strains of bacteria. 

12. A recombinant expression vector(s) containing the nucleic acid molecule as claimed in 
Claim 1 and host transformed with the vector expressing a polypeptide. 

13. A recombinant expression vector(s) as claimed in claim 1 1 wherein the vector is selectable 
from any suitable natural or artificial plasmid/vector. 

14. A recombinant expression vector(s) as claimed in claim 13 wherein said suitable natural or 
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artificial plasmid/vector, including, pUC 19 (Yannish-Perrbn et al. 1995), pProEX HT 
(GibcoBRL, Gaithersburg, MD, USA), pBR322 (Bolivar et al. 1977), pACYC184 (Chang 
et al. 1 978), pLAFR3 (Staskowicz et al. 1987). 

15. A polypeptide resulting from the transformation or transfection of a host cell with a 
recombinant expression vector as claimed in any one of Claims 12 through 14. 

1 6. A method of producing a polypeptide of claim 1 5 comprising the steps of: 

(a) culturing a host cell which has been transformed or transfected \yith said vector as 
defined above to express the encoded polypeptide or peptide; and 

(b) recovering the expressed polypeptide or peptide. 

1 7. A ligand that binds to a polypeptide of Claim 15. 

18. A ligand as claimed in claim 17 wherein" the ligand is an antibody or antibody binding 
fragment. ^ 

19. Probes and primers comprising a fragment of the nucleic acid molecule as claimed in 
Claim 1 wherein said fragment is hybridisable under stringent conditions to a native 
insecticidal gene sequence. 

20. Probes and primers comprising a fragment of the nucleic acid molecule as claimed in 
claim 19 wherein said probes and primers enable the structure and function of the gene to 
be determined and homologs of the gene to be obtained from bacteria other than Serratia 
sp. 

21. A polypeptide as claimed in Claim 15 wherein the polypeptide has insecticidal activity 
encoded by the nucleic acid molecule of claim 1, or a functional fragment, neutral 
mutation or homolog thereof. 

22. A polypeptide having insecticidal activity as claimed in claim 21 wherein the polypeptide 



